Bacterial promoters and methods of use

ABSTRACT

Compositions and methods are disclosed herein that relate to the development of fusion promoters for regulating gene expression in bacteria. Embodiments include fusion promoters comprising one or more operators linked to a promoter that is modified to have altered activity in Gram-positive organisms. Vectors and cells containing these fusion promoters are also described. Other embodiments include, methods of using these fusion promoters to regulate nucleic acid and/or polypeptide expression, methods of using these fusion promoters to identify proliferation-required genes, and methods of using these fusion promoters to identify molecules having potential antibiotic activity.

RELATED APPLICATIONS

[0001] This application claims priority to U.S. Provisional Patent Application No. 60/259434, filed Dec. 27, 2000, by Robert Haselbeck, and entitled “BACTERIAL PROMOTERS AND METHODS OF USE” and U.S. patent application Ser. No. 09/948,993, filed Sep. 6, 2001, by Daniel Tishkoff, and entitled “RAPID METHOD FOR REGULTING GENE EXPRESSION”, which claims priority to U.S. Provisional Patent Application Serial No. 60/230,335, filed Sep. 6, 2000, by Daniel Tishkoff, and entitled “RAPID REPLACEMENT OF GENOMIC PROMOTERS TO GENERATE STRAINS FOR USE IN A CELL-BASED ASSAY FOR ANTIBIOTICS”, the disclosures of which are incorporated herein by reference in their entirety.

FIELD OF THE INVENTION

[0002] The invention described herein relates to fusion promoters which comprise one or more promoters and one or more operators for directing transcription in Gram-positive bacteria. More specifically, some embodiments relate to fusion promoters comprising a promoter sequence that has been modified so as to alter its transcription activity in Gram-positive organisms linked to a xylose operator, cells having constructs that include such fusion promoters, and methods of using of these manufactures.

BACKGROUND OF THE INVENTION

[0003] Genetic information is expressed in bacteria by a series of steps involving transcription of DNA into mRNA and the subsequent translation of the mRNA into polypeptides or proteins. The expression of encoded information is initiated at a region on the DNA molecule called a “promoter”, onto which RNA polymerase recognizes and, thereby, initiates transcription. Promoters, which usually precede protein-coding nucleotide sequences (“structural genes”), vary in “strength” depending on the ability to recognize and bind RNA polymerase in a manner facilitating the initiation of transcription.

[0004] The regulation of transcription levels from certain promoters in bacteria can be facilitated by “operators” that are positioned adjacent to or within the promoter, Operator sequences generally function by interacting with modulator proteins, which either activate or repress the transcriptional activity of the promoter. Thus, the inherent strength of the promoter and the amount of regulation that can be imposed upon the promoter by an associated operator sequence interacting with the modulator protein contribute significantly to the control of transcription.

[0005] Current techniques in molecular biology enable the transfer to bacteria of exogenous nucleic acids, which one desires to transcribe. Successful transcription of the transferred nucleic acid in the host microorganism, however, depends on the association of the transferred nucleic acid with a suitable promoter and/or operator sequence. Frequently, the promoter and/or operator that is naturally associated with the nucleic acid to be transcribed will not support desired levels of transcription in the new host and, a new promoter and/or operator sequence must be operably linked to the nucleic acid. A variety of promoters and operators have been studied in this regard. (See e.g., von Gabain, et al., Proc. Natl. Acad. Sci. (USA), 76:189-193 (1979); Casadaban, et al., J. Molecular Biology, 138:179-207 (1980); West, et al., Gene, 9:175-193 (1980); (Chang et al., Nature, 275:615 (1978); Itakura et al., Science, 198:1056 (1977); Goeddel et al., Nature, 281:544 (1979); and (Goeddel et al., Nucleic Acids Res., 8:4057 (1980); and EPO Application Publication No. 0036776). However, many of the currently available regulatable promoters are designed for use in Gram-negative bacteria, such as E. coli. There remains a need for regulatable promoters that function in Gram-positive bacteria. In particular, the need exists for promoters that can be used to achieve the precise levels of nucleic acid and/or protein expression desired for applications such as cell-based gene and drug discovery in Gram-positive bacteria. For example, in the discovery of proliferation-required genes, the expression of a proliferation-required mRNA which is transcibed at a low level may be sufficiently reduced by the levels of antisense transcript produced from a promoter having a low maximal level of transcriptional activity to inhibit cell growth. However, the expression of a proliferation-required mRNA which is transcribed at higher levels may only be sufficiently reduced by levels of antisense transcript that can produced from a promoter having a high maximal transcriptional activity. As a result, the number of types of proliferation-required genes that can be discovered are determined in part by the transcriptional activity of the promoter used to express the proliferation-inhibiting antisense RNA. Accordingly, there exists a need for a promoter suite having a wide dynamic range of transcriptional activities in Gram-positive bacteria. In methods in which genes required for proliferation of Gram-positive organisms are identified by transcribing an antisense nucleic acid which is complementary to at least a portion of such genes, the endogenous transcription levels of the genes may vary. Accordingly, it is valuable to have a suite of promoters which provide a wide range of antisense nucleic acid transcription levels to ensure that a sufficient amount of antisense nucleic acid is obtained to allow the identification of proliferation-required genes which are normally expressed at high levels in the Gram-positive organism. In drug discovery applications which utilize Gram-positive cells that are sensitized by the expression of proliferation-inhibiting antisense RNAs, the degree to which the cells are sensitized is an important factor in determining the candidate compounds that are discovered. Furthermore, the expression of some proliferation-required target genes may be so greatly reduced by the production of proliferation-inhibiting antisense RNA from a highly active promoter that, rather than becoming sensitized, the cell dies. Accordingly, there exists a need for a promoter suite containing a collection of promoters which provide a wide range of basal transcription levels, maximal transcription levels, and degrees of induction which can be used to obtain a level of antisense nucleic acid which is optimal for the identification of a particular gene as being required for proliferation or optimal for identifying compounds which act on the gene product of a particular gene which is required for proliferation in cell-based assays using sensitized cells as described herein. Thus, there remains a need for a suite of promoters having the ability to tightly and/or finely regulate transcription in Gram-positive bacteria.

BRIEF SUMMARY OF THE INVENTION

[0006] Some aspects of the present invention are described in the numbered paragraphs below.

[0007] 1. An isolated nucleic acid comprising a fusion promoter said fusion promoter comprising at least one promoter that is modified to have altered activity in at least one gram-positive organism said promoter being linked to at least one operator selected from the group consisting of xylO, tetO, trpO, malO and λc1O, wherein said at least one operator is positioned such that binding of at least one repressor to said at least one operator represses transcription from said fusion promoter.

[0008] 2. The isolated fusion promoter of claim 1, wherein said at least one promoter is selected from the group consisting of SEQ ID NOs.: 36-45.

[0009] 3. The isolated nucleic acid of claim 1 wherein said at least one operator is xylO.

[0010] 4. The isolated nucleic acid of claim 3, wherein said at least one promoter is T5.

[0011] 5. The isolated nucleic acid of claim 3, further comprising a second operator.

[0012] 6. The isolated nucleic acid of claim 5, wherein said second operator is lacO.

[0013] 7. The isolated nucleic acid of claim 1, wherein said fusion promoter is responsive to an inducer.

[0014] 8. The isolated nucleic acid of claim 7, wherein said inducer is xylose.

[0015] 9. The isolated nucleic acid of claim 1, wherein said fusion promoter is titratable.

[0016] 10. The isolated nucleic acid of claim 1, wherein said at least one gram-positive organism is selected from the group consisting of Bacillus anthracis, Clostridium botulinum, Clostridium difficile, Clostridium perfringens, Clostridium teteni, Corynebacterium diptheriae, Enterococcus faecalis, Enterococcus faecium, Lactococcus lactis, Listeria monocytogenes, Mycobacterium leprae, Mycobacterium tuberculosis, Nocardia asteroides, Staphylococcus aureus, Staphylococcus epidermidis, Staphylococcus xylosis, Streptococcus pneumoniae, Streptococcus mutans and any species falling within the genera of any of the above species.

[0017] 11. The isolated nucleic acid of claim 1, wherein said at least one gram-positive organism is Staphylococcus aureus.

[0018] 12. The isolated nucleic acid of claim 1, wherein said at least one gram-positive organism is Enterococcus faecalis.

[0019] 13. An isolated nucleic acid comprising a fusion promoter said fusion promoter comprising at least one promoter selected from the group consisting of T5, CP25, P32, P59, P1P2 and PL, said promoter being linked to at least one operator selected from the group consisting of xylO, tetO, trpO, malO and λc1O, wherein said at least one operator is positioned such that binding of at least one repressor to said at least one operator represses transcription from said fusion promoter.

[0020] 14. The isolated nucleic acid of claim 13, wherein said at least one operator is xylO.

[0021] 15. The isolated nucleic acid of claim 14, wherein said at least one promoter is T5.

[0022] 16. The isolated nucleic acid of claim 14, further comprising a second operator.

[0023] 17. The isolated nucleic acid of claim 16, wherein said second operator is lacO.

[0024] 18. The isolated nucleic acid of claim 13, wherein said fusion promoter is responsive to an inducer.

[0025] 19. The isolated nucleic acid of claim 18, wherein said inducer is xylose.

[0026] 20. The isolated nucleic acid of claim 13, wherein said fusion promoter is titratable.

[0027] 21. The isolated nucleic acid of claim 13, wherein said at least one gram-positive organism is selected from the group consisting of Bacillus anthracis, Clostridium botulinum, Clostridium difficile, Clostridium perfringens, Clostridium teteni, Corynebacterium diptheriae, Enterococcus faecalis, Enterococcus faecium, Lactococcus lactis, Listeria monocytogenes, Mycobacterium leprae, Mycobacterium tuberculosis, Nocardia asteroides, Staphylococcus aureus, Staphylococcus epidermidis, Staphylococcus xylosis, Streptococcus pneumoniae, Streptococcus mutans and any species falling within the genera of any of the above species.

[0028] 22. The isolated nucleic acid of claim 13, wherein said at least one gram-positive organism is Staphylococcus aureus.

[0029] 23. The isolated nucleic acid of claim 13, wherein said at least one gram-positive organism is Enterococcus faecalis.

[0030] 24. An isolated fusion promoter comprising one of SEQ ID NO.: 26-35.

[0031] 25. A vector comprising the isolated nucleic acid of claim 1.

[0032] 26. A vector comprising the isolated nucleic acid of claim 13.

[0033] 27. A vector comprising the isolated nucleic acid of claim 24.

[0034] 28. The vector of claim 25 further comprising at least one replicon selected from the group consisting of p15a, pC194 and pCT1138.

[0035] 29. The vector of claim 25 further comprising a reporter gene operably linked to said fusion promoter.

[0036] 30. The vector of claim 29, wherein said reporter gene is lacL-lacM.

[0037] 31. The vector of claim 30, wherein lacL-lacM is derived from Leuconostoc mesenteroides.

[0038] 32. The vector of claim 25, wherein said at least one operator is xylO.

[0039] 33. The vector of claim 32, wherein said at least one promoter is T5.

[0040] 34. The vector of claim 32, further comprising a second operator.

[0041] 35. The vector of claim 34, wherein said second operator is lacO.

[0042] 36. The vector of claim 25, wherein said fusion promoter is responsive to an inducer.

[0043] 37. The vector of claim 36, wherein said inducer is xylose.

[0044] 38. The vector of claim 25, wherein said fusion promoter is titratable.

[0045] 39. The vector of claim 25, wherein said at least one gram-positive organism is selected from the group consisting of Bacillus anthracis, Clostridium botulinum, Clostridium difficile, Clostridium perfringens, Clostridium teteni, Corynebacterium diptheriae, Enterococcus faecalis, Enterococcus faecium, Lactococcus lactis, Listeria monocytogenes, Mycobacterium leprae, Mycobacterium tuberculosis, Nocardia asteroides, Staphylococcus aureus, Staphylococcus epidermidis, Staphylococcus xylosis, Streptococcus pneumoniae, Streptococcus mutans and any species falling within the genera of any of the above species.

[0046] 40. The vector of claim 25, wherein said at least one gram-positive organism is Staphylococcus aureus.

[0047] 41. The vector of claim 25, wherein said at least one gram-positive organism is Enterococcus faecalis.

[0048] 42. The vector of claim 25, further comprising a random fragment of a microbial genome operably linked to said fusion promoter.

[0049] 43. The vector of claim 25, further comprising a nucleic acid that encodes a peptide, wherein said nucleic acid is operably linked to said fusion promoter.

[0050] 44. The vector of claim 25, further comprising a nucleic acid that is complementary to a portion of a microbial genome, wherein said nucleic acid is operably linked to said fusion promoter.

[0051] 45. The vector of claim 25, further comprising a nucleic acid that encodes a molecule that inhibits the proliferation of microbe, wherein said nucleic acid is operably linked to said fusion promoter.

[0052] 46. A host cell comprising the nucleic acid of claim 1.

[0053] 47. A host cell comprising the nucleic acid of claim 13.

[0054] 48. A method of expressing a biomolecule in a cell said method comprising:

[0055] (a) introducing a construct into a cell said construct comprising a fusion promoter operably linked to a nucleic acid encoding a biomolecule, said fusion promoter comprising at least one promoter that is modified to have altered activity in at least one gram-positive organism said promoter being linked to at least one operator selected from the group consisting of xylO, tetO, trpO, malO and λc1O, wherein said at least one operator is positioned such that binding of at least one repressor to said at least one operator represses transcription from said fusion promoter; and

[0056] (b) transcribing said nucleic acid encoding a biomolecule from said fusion promoter.

[0057] 49. The method of claim 48, wherein said at least one operator is xylO.

[0058] 50. The method of claim 49, wherein said at least one promoter is T5.

[0059] 51. The method of claim 49, wherein said fusion promoter further comprises a second operator.

[0060] 52. The method of claim 51, wherein said second operator is lacO.

[0061] 53. The method of claim 48, wherein said fusion promoter is responsive to an inducer.

[0062] 54. The method of claim 53, wherein said inducer is xylose.

[0063] 55. The method of claim 48, wherein said fusion promoter is titratable.

[0064] 56. The method of claim 48, wherein said at least one gram-positive organism is selected from the group consisting of Bacillus anthracis, Clostridium botulinum, Clostridium difficile, Clostridium perfringens, Clostridium teteni, Corynebacterium diptheriae, Enterococcus faecalis, Enterococcus faecium, Lactococcus lactis, Listeria monocytogenes, Mycobacterium leprae, Mycobacterium tuberculosis, Nocardia asteroides, Staphylococcus aureus, Staphylococcus epidermidis, Staphylococcus xylosis, Streptococcus pneumoniae, Streptococcus mutans and any species falling within the genera of any of the above species.

[0065] 57. The method of claim 48, wherein said at least one gram-positive organism is Staphylococcus aureus.

[0066] 58. The method of claim 48, wherein said at least one gram-positive organism is Enterococcus faecalis.

[0067] 59. The method of claim 48, wherein said nucleic acid is a random fragment of a microbial genome.

[0068] 60. The method of claim 48, wherein said biomolecule is a peptide or nucleic acid.

[0069] 61. The method of claim 48, wherein said biomolecule is a nucleic acid that is complementary to a portion of a microbial genome.

[0070] 62. The method of claim 48, wherein said biomolecule is a nucleic acid that encodes a molecule that inhibits the proliferation of a microbe.

[0071] 63. A biomolecule produced by the method of claim 48.

[0072] 64. A method for identifying genes involved in cellular proliferation said method comprising the steps of:

[0073] (a) introducing into cells of a cell population a construct comprising an inducible fusion promoter operably linked to a nucleic acid, said fusion promoter comprising at least one promoter that is modified to have altered activity in at least one gram-positive organism said promoter being linked to at least one operator selected from the group consisting of xylO, tetO, trpO, malO and λc1O, wherein said at least one operator is positioned such that binding of at least one repressor to said at least one operator represses transcription from said fusion promoter;

[0074] (b) inducing transcription of said nucleic acid from said inducible fusion promoter;

[0075] (c) identifying the cells in said cell population whose proliferation is reduced in response to the induction of transcription of said nucleic acid; and

[0076] (d) identifying the gene from a cell identified in step (c) to which at least a portion of said nucleic acid is complementary.

[0077] 65. A method for identifying genes involved in cellular proliferation said method comprising the steps of:

[0078] (a) introducing into cells of a cell population a construct comprising an inducible fusion promoter operably linked to a nucleic acid, said fusion promoter comprising at least one promoter selected from the group consisting of T5, CP25, P32, P59, P1P2 and PL, said promoter being linked to at least one operator selected from the group consisting of xylO, tetO, trpO, malO and λc1O, wherein said at least one operator is positioned such that binding of at least one repressor to said at least one operator represses transcription from said fusion promoter;

[0079] (b) inducing transcription of said nucleic acid from said inducible fusion promoter;

[0080] (c) identifying the cells in said cell population whose proliferation is reduced in response to the induction of transcription of said nucleic acid; and

[0081] (d) identifying the gene from a cell identified in step (c) to which at least a portion of said nucleic acid is complementary.

[0082] 66. The method of claim 64, wherein said nucleic acid is random fragment of a microbial genome.

[0083] 67. The method of claim 64, wherein said nucleic acid encodes an aptamer.

[0084] 68. The method of claim 64, wherein said nucleic acid encodes a transcript that is complementary to a portion of a microbial genome.

[0085] 69. The method of claim 64, wherein said nucleic acid encodes a molecule that inhibits the proliferation of a microbe.

[0086] 70. The method of claim 64, wherein said at least one operator is xylO.

[0087] 71. The method of claim 70, wherein said fusion promoter is T5.

[0088] 72. The method of claim 70, wherein said fusion promoter further comprises a second operator.

[0089] 73. The method of claim 72, wherein said second operator is lacO.

[0090] 74. The method of claim 64, wherein said inducer is xylose.

[0091] 75. The method of claim 64, wherein said fusion promoter is titratable.

[0092] 76. The method of claim 64, wherein said at least one gram-positive organism is selected from the group consisting of Bacillus anthracis, Clostridium botulinum, Clostridium difficile, Clostridium perfringens, Clostridium teteni, Corynebacterium diptheriae, Enterococcus faecalis, Enterococcus faecium, Lactococcus lactis, Listeria monocytogenes, Mycobacterium leprae, Mycobacterium tuberculosis, Nocardia asteroides, Staphylococcus aureus, Staphylococcus epidermidis, Staphylococcus xylosis, Streptococcus pneumoniae, Streptococcus mutans and any species falling within the genera of any of the above species.

[0093] 77. The method of claim 64, wherein said at least one gram-positive organism is Staphylococcus aureus.

[0094] 78. The method of claim 64, wherein said at least one gram-positive organism is Enterococcus faecalis.

[0095] 79. A method for identifying genes involved in cellular proliferation said method comprising the steps of:

[0096] (a) introducing into the genome of a cell a construct comprising an inducible fusion promoter operably linked to a nucleic acid, said fusion promoter comprising at least one promoter that is modified to have altered activity in at least one gram-positive organism said promoter being linked to at least one operator selected from the group consisting of xylO, tetO, trpO, malO and λc1O, wherein said at least one operator is positioned such that binding of at least one repressor to said at least one operator represses transcription from said fusion promoter;

[0097] (b) comparing the proliferation of said cell cultured in the presence of a first concentration of an inducer that induces transcription from said fusion promoter with the proliferation of said cell cultured in the presence of a concentration of said inducer that is less than said first concentration, wherein a difference in proliferation indicates that said fusion promoter modulates a gene that is required for proliferation; and

[0098] (c) identifying the gene that is modulated by said fusion promoter.

[0099] 80. A method for identifying genes involved in cellular proliferation said method comprising the steps of:

[0100] (a) introducing into the genome of a cell a construct comprising an inducible fusion promoter operably linked to a nucleic acid, said fusion promoter comprising at least one promoter selected from the group consisting of T5, CP25, P32, P59, P1P2 and PL, said promoter being linked to at least one operator selected from the group consisting of xylO, tetO, trpO, malO and λc1O, wherein said at least one operator is positioned such that binding of at least one repressor to said at least one operator represses transcription from said fusion promoter;

[0101] (b) comparing the proliferation of said cell cultured in the presence of a first concentration of an inducer that induces transcription from said fusion promoter with the proliferation of said cell cultured in the presence of a concentration of said inducer that is less than said first concentration, wherein a difference in proliferation indicates that said fusion promoter modulates a gene that is required for proliferation; and

[0102] (c) identifying the gene that is modulated by said fusion promoter.

[0103] 81. The method of claim 79, wherein said nucleic acid is random fragment of a microbial genome.

[0104] 82. The method of claim 79, wherein said nucleic acid encodes an aptamer.

[0105] 83. The method of claim 79, wherein said nucleic acid encodes a transcript that is complementary to a portion of a microbial genome.

[0106] 84. The method of claim 79, wherein said nucleic acid encodes a molecule that inhibits the proliferation of a microbe.

[0107] 85. The method of claim 79, wherein said at least one operator is xylO.

[0108] 86. The method of claim 85, wherein said at least one promoter is T5.

[0109] 87. The method of claim 85, wherein said fusion promoter further comprises a second operator.

[0110] 88. The method of claim 87 wherein said second operator is lacO.

[0111] 89. The method of claim 79, wherein said inducer is xylose.

[0112] 90. The method of claim 79, wherein said fusion promoter is titratable.

[0113] 91. The method of claim 79, wherein said at least one gram-positive organism is selected from the group consisting of Bacillus anthracis, Clostridium botulinum, Clostridium difficile, Clostridium perfringens, Clostridium teteni, Corynebacterium diptheriae, Enterococcus faecalis, Enterococcus faecium, Lactococcus lactis, Listeria monocytogenes, Mycobacterium leprae, Mycobacterium tuberculosis, Nocardia asteroides, Staphylococcus aureus, Staphylococcus epidermidis, Staphylococcus xylosis, Streptococcus pneumoniae, Streptococcus mutans and any species falling within the genera of any of the above species.

[0114] 92. The method of claim 79, wherein said at least one gram-positive organism is Staphylococcus aureus.

[0115] 93. The method of claim 79, wherein said at least one gram-positive organism is Enterococcus faecalis.

[0116] 94. A method for identifying a compound which reduces the activity or level of a gene product required for proliferation of a cell said method comprising:

[0117] (a) introducing into a cell a construct comprising an inducible fusion promoter operably linked to a nucleic acid that is complementary to at least a portion of a proliferation-required gene, said fusion promoter comprising at least one promoter that is modified to have altered activity in at least one gram-positive organism said promoter being linked to at least one operator selected from the group consisting of xylO, tetO, trpO, malO and λc1O, wherein said at least one operator is positioned such that binding of at least one repressor to said at least one operator represses transcription from said fusion promoter;

[0118] (b) sensitizing said cell by inducing transcription from said fusion promoter;

[0119] (c) contacting said sensitized cell with a compound; and

[0120] (d) determining the degree to which said compound inhibits proliferation of said sensitized cell relative to a cell which has not been sensitized.

[0121] 95. A method for identifying a compound which reduces the activity or level of a gene product required for proliferation of a cell said method comprising:

[0122] (a) introducing into a cell a construct comprising an inducible fusion promoter operably linked to a nucleic acid that is complementary to at least a portion of a proliferation-required gene, said fusion promoter comprising at least one promoter selected from the group consisting of T5, CP25, P32, P59, P1P2 and PL, said promoter being linked to at least one operator selected from the group consisting of xylO, tetO, trpO, malO and λc1O, wherein said at least one operator is positioned such that binding of at least one repressor to said at least one operator represses transcription from said fusion promoter;

[0123] (b) sensitizing said cell by inducing transcription from said fusion promoter;

[0124] (c) contacting said sensitized cell with a compound; and

[0125] (d) determining the degree to which said compound inhibits proliferation of said sensitized cell relative to a cell which has not been sensitized.

[0126] 96. The method of claim 94, wherein said at least one operator is xylO.

[0127] 97. The method of claim 96, wherein said at least one promoter is T5.

[0128] 98. The method of claim 96, wherein said fusion promoter further comprises a second operator.

[0129] 99. The method of claim 98, wherein said second operator is lacO.

[0130] 100. The method of claim 94, wherein said inducer is xylose.

[0131] 101. The method of claim 94, wherein said fusion promoter is titratable.

[0132] 102. The method of claim 94, wherein said at least one gram-positive organism is selected from the group consisting of Bacillus anthracis, Clostridium botulinum, Clostridium difficile, Clostridium perfringens, Clostridium teteni, Corynebacterium diptheriae, Enterococcus faecalis, Enterococcus faecium, Lactococcus lactis, Listeria monocytogenes, Mycobacterium leprae, Mycobacterium tuberculosis, Nocardia asteroides, Staphylococcus aureus, Staphylococcus epidermidis, Staphylococcus xylosis, Streptococcus pneumoniae, Streptococcus mutans and any species falling within the genera of any of the above species.

[0133] 103. The method of claim 94, wherein said at least one gram-positive organism is Staphylococcus aureus.

[0134] 104. The method of claim 94, wherein said at least one gram-positive organism is Enterococcus faecalis.

[0135] 105. A compound identified using the method of claim 94.

[0136] 106. A method for inhibiting the activity or expression of a gene in an operon required for proliferation said method comprising:

[0137] (a) introducing into a cell a construct comprising an inducible fusion promoter operably linked to a nucleic acid that is complementary to at least a portion of a proliferation-required operon, said fusion promoter comprising at least one promoter that is modified to have altered activity in at least one gram-positive organism said promoter being linked to at least one operator selected from the group consisting of xylO, tetO, trpO, malO and λc1O, wherein said at least one operator is positioned such that binding of at least one repressor to said at least one operator represses transcription from said fusion promoter; and

[0138] (b) inducing transcription from said fusion promoter.

[0139] 107. A method for inhibiting the activity or expression of a gene in an operon required for proliferation said method comprising:

[0140] (a) introducing into a cell a construct comprising an inducible fusion promoter operably linked to a nucleic acid that is complementary to at least a portion of a proliferation-required operon, said fusion promoter comprising at least one promoter selected from the group consisting of T5, CP25, P32, P59, P1P2 and PL, said promoter being linked to at least one operator selected from the group consisting of xylO, tetO, trpO, malO and λc1O, wherein said at least one operator is positioned such that binding of at least one repressor to said at least one operator represses transcription from said fusion promoter; and

[0141] (b) inducing transcription from said fusion promoter.

[0142] 108. The method of claim 106, wherein said at least one operator is xylO.

[0143] 109. The method of claim 108, wherein said at least one promoter is T5.

[0144] 110. The method of claim 108, wherein said fusion promoter further comprises a second operator.

[0145] 111. The method of claim 110, wherein said second operator is lacO.

[0146] 112. The method of claim 106, wherein said inducer is xylose.

[0147] 113. The method of claim 106, wherein said fusion promoter is titratable.

[0148] 114. The method of claim 106, wherein said at least one gram-positive organism is selected from the group consisting of Bacillus anthracis, Clostridium botulinum, Clostridium difficile, Clostridium perfringens, Clostridium teteni, Corynebacterium diptheriae, Enterococcus faecalis, Enterococcus faecium, Lactococcus lactis, Listeria monocytogenes, Mycobacterium leprae, Mycobacterium tuberculosis, Nocardia asteroides, Staphylococcus aureus, Staphylococcus epidermidis, Staphylococcus xylosis, Streptococcus pneumoniae, Streptococcus mutans and any species falling within the genera of any of the above species.

[0149] 115. The method of claim 106, wherein said at least one gram-positive organism is Staphylococcus aureus.

[0150] 116. The method of claim 106, wherein said at least one gram-positive organism is Enterococcus faecalis.

[0151] 117. A method for identifying the biological pathway in which a proliferation-required gene or its gene product lies said method comprising:

[0152] (a) introducing into a test cell a construct comprising an inducible fusion promoter operably linked to a nucleic acid that is complementary to at least a portion of a proliferation-required gene, said fusion promoter comprising at least one promoter that is modified to have altered activity in at least one gram-positive organism said promoter being linked to at least one operator selected from the group consisting of xylO, tetO, trpO, malO and λc1O, wherein said at least one operator is positioned such that binding of at least one repressor to said at least one operator represses transcription from said fusion promoter;

[0153] (b) inhibiting the activity of said proliferation-required gene or gene product in said test cell by inducing transcription from said fusion promoter;

[0154] (c) contacting said test cell with a compound known to inhibit growth or proliferation of a cell, wherein the biological pathway on which said compound acts is known; and

[0155] (d) determining the degree to which said compound inhibits proliferation of said test cell relative to a cell which has not been contacted with said compound.

[0156] 118. A method for identifying the biological pathway in which a proliferation-required gene or its gene product lies said method comprising:

[0157] (a) introducing into a test cell a construct comprising an inducible fusion promoter operably linked to a nucleic acid that is complementary to at least a portion of a proliferation-required gene, said fusion promoter comprising at least one promoter selected from the group consisting of T5, CP25, P32, P59, P1P2 and PL, said promoter being linked to at least one operator selected from the group consisting of xylO, tetO, trpO, malO and λc1O, wherein said at least one operator is positioned such that binding of at least one repressor to said at least one operator represses transcription from said fusion promoter;

[0158] (b) inhibiting the activity of said proliferation-required gene or gene product in said test cell by inducing transcription from said fusion promoter;

[0159] (c) contacting said test cell with a compound known to inhibit growth or proliferation of a cell, wherein the biological pathway on which said compound acts is known; and

[0160] (d) determining the degree to which said compound inhibits proliferation of said test cell relative to a cell which has not been contacted with said compound.

[0161] 119. The method of claim 117, wherein said at least one operator is xylO.

[0162] 120. The method of claim 119, wherein said at least one promoter is T5.

[0163] 121. The method of claim 119, wherein said fusion promoter further comprises a second operator.

[0164] 122. The method of claim 121, wherein said second operator is lacO.

[0165] 123. The method of claim 117, wherein said inducer is xylose.

[0166] 124. The method of claim 117, wherein said fusion promoter is titratable.

[0167] 125. The method of claim 117, wherein said at least one gram-positive organism is selected from the group consisting of Bacillus anthracis, Clostridium botulinum, Clostridium difficile, Clostridium perfringens, Clostridium teteni, Corynebacterium diptheriae, Enterococcus faecalis, Enterococcus faecium, Lactococcus lactis, Listeria monocytogenes, Mycobacterium leprae, Mycobacterium tuberculosis, Nocardia asteroides, Staphylococcus aureus, Staphylococcus epidermidis, Staphylococcus xylosis, Streptococcus pneumoniae, Streptococcus mutans and any species falling within the genera of any of the above species.

[0168] 126. The method of claim 117, wherein said at least one gram-positive organism is Staphylococcus aureus.

[0169] 127. The method of claim 117, wherein said at least one gram-positive organism is Enterococcus faecalis.

[0170] 128. A method for determining the biological pathway on which a test compound acts said method comprising:

[0171] (a) introducing into a cell a construct comprising an inducible fusion promoter operably linked to a nucleic acid that is complementary to at least a portion of a proliferation-required gene, wherein the biological pathway in which said gene or product of said gene lies is known, and wherein said fusion promoter comprises at least one promoter that is modified to have altered activity in at least one gram-positive organism said promoter being linked to at least one operator selected from the group consisting of xylO, tetO, trpO, malO and λc1O, wherein said at least one operator is positioned such that binding of at least one repressor to said at least one operator represses transcription from said fusion promoter;

[0172] (b) inhibiting the activity of said proliferation-required in said cell by inducing transcription from said fusion promoter;

[0173] (c) contacting said cell with a test compound; and

[0174] (d) determining the degree to which said test compound inhibits proliferation of said cell relative to a cell containing a which does not contain said construct.

[0175] 129. A method for determining the biological pathway on which a test compound acts said method comprising:

[0176] (a) introducing into a cell a construct comprising an inducible fusion promoter operably linked to a nucleic acid that is complementary to at least a portion of a proliferation-required gene, wherein the biological pathway in which said gene or product of said gene lies is known, and wherein said fusion promoter comprises at least one promoter selected from the group consisting of T5, CP25, P32, P59, P1P2 and PL, said promoter being linked to at least one operator selected from the group consisting of xylO, tetO, trpO, malO and λc1O, wherein said at least one operator is positioned such that binding of at least one repressor to said at least one operator represses transcription from said fusion promoter;

[0177] (b) inhibiting the activity of said proliferation-required in said cell by inducing transcription from said fusion promoter;

[0178] (c) contacting said cell with a test compound; and

[0179] (d) determining the degree to which said test compound inhibits proliferation of said cell relative to a cell containing a which does not contain said construct.

[0180] 130. The method of claim 128, wherein said at least one operator is xylO.

[0181] 131. The method of claim 130, wherein said at least one promoter is T5.

[0182] 132. The method of claim 130, wherein said fusion promoter further comprises a second operator.

[0183] 133. The method of claim 132, wherein said second operator is lacO.

[0184] 134. The method of claim 128, wherein said inducer is xylose.

[0185] 135. The method of claim 128, wherein said fusion promoter is titratable.

[0186] 136. The method of claim 128, wherein said at least one gram-positive organism is selected from the group consisting of Bacillus anthracis, Clostridium botulinum, Clostridium difficile, Clostridium perfringens, Clostridium teteni, Corynebacterium diptheriae, Enterococcus faecalis, Enterococcus faecium, Lactococcus lactis, Listeria monocytogenes, Mycobacterium leprae, Mycobacterium tuberculosis, Nocardia asteroides, Staphylococcus aureus, Staphylococcus epidermidis, Staphylococcus xylosis, Streptococcus pneumoniae, Streptococcus mutans and any species falling within the genera of any of the above species.

[0187] 137. The method of claim 128, wherein said at least one gram-positive organism is Staphylococcus aureus.

[0188] 138. The method of claim 128, wherein said at least one gram-positive organism is Enterococcus faecalis.

[0189] 139. A method of manufacturing an antibiotic comprising the steps of:

[0190] (a) introducing into a cell a construct comprising an inducible fusion promoter operably linked to a nucleic acid that is complementary to at least a portion of a proliferation-required gene, said fusion promoter comprising at least one promoter that is modified to have altered activity in at least one gram-positive organism said promoter being linked to at least one operator selected from the group consisting of xylO, tetO, trpO, malO and λc1O, wherein said at least one operator is positioned such that binding of at least one repressor to said at least one operator represses transcription from said fusion promoter;

[0191] (b) sensitizing said cell by inducing transcription from said fusion promoter;

[0192] (c) contacting said sensitized cell with a compound;

[0193] (d) identifying a compound which substantially inhibits the proliferation of said sensitized cell relative to a cell which has not been sensitized; and

[0194] (e) manufacturing the compound so identified.

[0195] 140. A method of manufacturing an antibiotic comprising the steps of:

[0196] (a) introducing into a cell a construct comprising an inducible fusion promoter operably linked to a nucleic acid that is complementary to at least a portion of a proliferation-required gene, said fusion promoter comprising at least one promoter selected from the group consisting of T5, CP25, P32, P59, P1P2 and PL, said promoter being linked to at least one operator selected from the group consisting of xylO, tetO, trpO, malO and λc1O, wherein said at least one operator is positioned such that binding of at least one repressor to said at least one operator represses transcription from said fusion promoter;

[0197] (b) sensitizing said cell by inducing transcription from said fusion promoter;

[0198] (c) contacting said sensitized cell with a compound;

[0199] (d) identifying a compound which substantially inhibits the proliferation of said sensitized cell relative to a cell which has not been sensitized; and

[0200] (e) manufacturing the compound so identified.

[0201] 141. The method of claim 139, wherein said at least one operator is xylO.

[0202] 142. The method of claim 141, wherein said at least one promoter is T5.

[0203] 143. The method of claim 141, wherein said fusion promoter further comprises a second operator.

[0204] 144. The method of claim 143, wherein said second operator is lacO.

[0205] 145. The method of claim 139, wherein said inducer is xylose.

[0206] 146. The method of claim 139, wherein said fusion promoter is titratable.

[0207] 147. The method of claim 139, wherein said at least one gram-positive organism is selected from the group consisting of Bacillus anthracis, Clostridium botulinum, Clostridium difficile, Clostridium perfringens, Clostridium teteni, Corynebacterium diptheriae, Enterococcus faecalis, Enterococcus faecium, Lactococcus lactis, Listeria monocytogenes, Mycobacterium leprae, Mycobacterium tuberculosis, Nocardia asteroides, Staphylococcus aureus, Staphylococcus epidermidis, Staphylococcus xylosis, Streptococcus pneumoniae, Streptococcus mutans and any species falling within the genera of any of the above species.

[0208] 148. The method of claim 139, wherein said at least one gram-positive organism is Staphylococcus aureus.

[0209] 149. The method of claim 139, wherein said at least one gram-positive organism is Enterococcus faecalis.

[0210] 150. A method for identifying a gene which is required for proliferation of a prokaryotic cell said method comprising the steps of:

[0211] (a) replacing the native promoter of a gene in the chromosome of a prokaryotic cell having an enhanced frequency of homologous recombination with a regulatable fusion promoter comprising at least one promoter that is modified to have altered activity in at least one gram-positive organism said promoter being linked to at least one operator selected from the group consisting of xylO, tetO, trpO, malO, lacO and λc1O, wherein said at least one operator is positioned such that binding of at least one repressor to said at least one operator represses transcription from said fusion promoter; and

[0212] (b) identifying cells in which the extent of proliferation of said cell when said fusion promoter is active at a first level is substantially different than the extent of proliferation of said cell when said fusion promoter is active at a second level, said first level being greater than said second level.

[0213] 151. A method for identifying a gene which is required for proliferation of a prokaryotic cell said method comprising the steps of:

[0214] (a) replacing the native promoter of a gene in the chromosome of a prokaryotic cell having an enhanced frequency of homologous recombination with a regulatable fusion promoter comprising at least one promoter selected from the group consisting of T5, CP25, P32, P59, P1P2 and PL, said promoter being linked to at least one operator selected from the group consisting of xylO, tetO, trpO, malO, lacO and λc1O, wherein said at least one operator is positioned such that binding of at least one repressor to said at least one operator represses transcription from said fusion promoter; and

[0215] (b) identifying cells in which the extent of proliferation of said cell when said fusion promoter is active at a first level is substantially different than the extent of proliferation of said cell when said fusion promoter is active at a second level, said first level being greater than said second level.

[0216] 152. The method of claim 150, wherein said fusion promoter is an inducible promoter.

[0217] 153. The method of claim 150, wherein the step of replacing said native promoter comprises introducing a linear nucleic acid comprising a 5′ portion homologous to a first portion of said native promoter, a 3′ portion homologous to a second portion of said native promoter and said fusion promoter disposed between said 5′ portion and said 3′ portion into said cell such that homologous recombination occurs between said 5′ portion and said first portion of said native promoter and between said 3′ portion and said second portion of said native promoter.

[0218] 154. The method of claim 153, wherein said linear nucleic acid is double stranded.

[0219] 155. The method of claim 153, wherein said linear nucleic acid is single stranded.

[0220] 156. A method for identifying a compound which inhibits the proliferation of a prokaryotic cell said method comprising the steps of:

[0221] (a) replacing the native promoter of a gene in the chromosome of a prokaryotic cell having an enhanced frequency of homologous recombination with a regulatable fusion promoter comprising at least one promoter that is modified to have altered activity in at least one gram-positive organism said promoter being linked to at least one operator selected from the group consisting of xylO, tetO, trpO, malO, lacO and λc1O, wherein said at least one operator is positioned such that binding of at least one repressor to said at least one operator represses transcription from said fusion promoter; and

[0222] (b) comparing the extent of proliferation of a first sample of said cell in the presence of said compound to the extent of proliferation of a second sample of said cell in the presence of said compound, wherein said first sample of said cell has a reduced activity of said fusion promoter activity relative the activity of said fusion promoter in said second sample of said cell and wherein said compound inhibits the proliferation of said cell if the extent of proliferation of said first sample of said cell is substantially less than the extent of proliferation of said second sample of said cell.

[0223] 157. A method for identifying a compound which inhibits the proliferation of a prokaryotic cell said method comprising the steps of:

[0224] (a) replacing the native promoter of a gene in the chromosome of a prokaryotic cell having an enhanced frequency of homologous recombination with a regulatable fusion promoter comprising at least one promoter selected from the group consisting of T5, CP25, P32, P59, P1P2 and PL, said promoter being linked to at least one operator selected from the group consisting of xylO, tetO, trpO, malO, lacO and λc1O, wherein said at least one operator is positioned such that binding of at least one repressor to said at least one operator represses transcription from said fusion promoter; and

[0225] (b) comparing the extent of proliferation of a first sample of said cell in the presence of said compound to the extent of proliferation of a second sample of said cell in the presence of said compound, wherein said first sample of said cell has a reduced activity of said fusion promoter activity relative the activity of said fusion promoter in said second sample of said cell and wherein said compound inhibits the proliferation of said cell if the extent of proliferation of said first sample of said cell is substantially less than the extent of proliferation of said second sample of said cell.

[0226] 158. The method of claim 156, wherein the step of replacing said native promoter comprises introducing a linear nucleic acid comprising a 5′ portion homologous to a first portion of said native promoter, a 3′ portion homologous to a second portion of said native promoter and said fusion promoter disposed between said 5′ portion and said 3′ portion into said cell such that homologous recombination occurs between said 5′ portion and said first portion of said native promoter and between said 3′ portion and said second portion of said native promoter.

[0227] 159. The method of claim 158, wherein said linear nucleic acid is double stranded.

[0228] 160. The method of claim 158, wherein said linear nucleic acid is single stranded.

[0229] 161. A method for identifying a gene which is required for proliferation of a prokaryotic cell said method comprising the steps of:

[0230] (a) introducing at least one operator selected from the group consisting of xylO, tetO, trpO, malO, lacO and λc1O into a prokaryotic cell having an enhanced frequency of homologous recombination such that said at least one operator regulates transcription of a target nucleic acid in the chromosome of said cell; and

[0231] (b) identifying cells in which the extent of proliferation of said cell when said target nucleic acid is transcribed at a first level is substantially different than the extent of proliferation of said cell when said target nucleic acid is transcribed at a second level, said first level being greater than said second level.

[0232] 162. The method of claim 161, wherein the step of introducing said at least one operator comprises introducing a linear nucleic acid comprising a 5′ portion homologous to a first portion of the chromosome of said cell, a 3′ portion homologous to a second portion of the chromosome of said cell and said at least one operator disposed between said 5′ portion and said 3′ portion into said cell such that homologous recombination occurs between the 5′ portion and said first portion of the chromosome of said cell and between the 3′ portion and said second portion of the chromosome of said cell.

[0233] 163. The method of claim 162, wherein said linear nucleic acid is double stranded.

[0234] 164. The method of claim 162, wherein said linear nucleic acid is single stranded.

[0235] 165. A method of identifying a compound which inhibits the proliferation of a prokaryotic cell said method comprising the steps of:

[0236] (a) obtaining a prokaryotic cell in which transcription of a nucleic acid required for proliferation of said cell is regulated by at least one operator which has been introduced into the chromosome of said cell said at least one operator selected from the group consisting of xylO, tetO, trpO, malO, lacO and λc1O;

[0237] (b) sensitizing said cell by growing said cell under conditions in which the level of transcription of said gene is lower than that of a wild type cell;

[0238] (c) contacting said sensitized cell with said compound; and

[0239] (d) determining the degree to which said compound inhibits the growth of said sensitized cell relative to an unsensitized cell.

[0240] 166. The method of claim 165, wherein said cell has an enhanced frequency of homologous recombination.

[0241] 167. The method of claim 165, wherein said linear nucleic acid is single stranded.

[0242] 168. The method of claim 165, wherein said linear nucleic acid is double stranded.

[0243] 169. A method of identifying a nucleic acid sequence having promoter activity in Enterococcus faecalis said method comprising the steps of:

[0244] (a) inserting a candidate nucleic acid into a vector comprising lacL-lacM reporter genes such that said candidate nucleic acid is upstream of the lacL-lacM reporter genes;

[0245] (b) introducing said vector comprising said candidate nucleic acid into Enterococcus faecalis;

[0246] (c) detecting expression of the lacL-lacM reporter genes, wherein expression of said lacL-lacM reporter genes indicates that said candidate nucleic acid sequence has promoter activity.

[0247] 170. The method of claim 169, wherein said detecting step comprises measuring β-galactosidase activity.

[0248] 171. The method of claim 169, wherein said candidate nucleic acid is a promoter modified to increase activity in a gram-positive organism.

[0249] 172. The method of claim 169, wherein said lacL-lacM reporter gene is derived from Leuconostoc mesenteroides.

[0250] 173. The method of claim 169, wherein said vector is pEPEF1.

Definitions

[0251] The following definitions are provided so as to facilitate the understanding of the invention as set out herein.

[0252] By “activity against a gene product” is meant having the ability to inhibit the function or to reduce the level or activity of the gene product in a cell. This includes, but is not limited to, inhibiting the enzymatic activity of the gene product or the ability of the gene product to interact with other biological molecules required for its activity, including inhibiting the gene product's assembly into a multimeric structure.

[0253] By “activity against a protein” is meant having the ability to inhibit the function or to reduce the level or activity of the protein in a cell. This includes, but is not limited to, inhibiting the enzymatic activity of the protein or the ability of the protein to interact with other biological molecules required for its activity, including inhibiting the protein's assembly into a multimeric structure.

[0254] By “activity against a nucleic acid” is meant having the ability to inhibit the function or to reduce the level or activity of the nucleic acid in a cell. This includes, but is not limited to, inhibiting the ability of the nucleic acid interact with other biological molecules required for its activity, including inhibiting the nucleic acid's assembly into a multimeric structure.

[0255] By “activity against a gene” is meant having the ability to inhibit the function or expression of the gene in a cell. This includes, but is not limited to, inhibiting the ability of the gene to interact with other biological molecules required for its activity.

[0256] By “activity against an operon” is meant having the ability to inhibit the function or reduce the level of one or more products of the operon in a cell. This includes, but is not limited to, inhibiting the enzymatic activity of one or more products of the operon or the ability of one or more products of the operon to interact with other biological molecules required for its activity.

[0257] By “antibiotic” is meant an agent which inhibits the proliferation of a cell or microorganism.

[0258] The term “aptamer” as used herein refers to a single-stranded or double-stranded oligodeoxyribonucleotide, oligoribonucleotide or modified derivatives of oligodeoxyribonucleotides or oligoribonucleotides that specifically binds and alters the biological function of a target molecule, which can be a protein, peptide and derivatives thereof.

[0259] By “biological pathway” is meant any discrete cell function or process that is carried out by a gene product or a subset of gene products. Biological pathways include anabolic, catabolic, enzymatic, biochemical and metabolic pathways as well as pathways involved in the production of cellular structures such as cell walls. Biological pathways that are usually required for proliferation of cells or microorganisms include, but are not limited to, cell division, DNA synthesis and replication, RNA synthesis (transcription), protein synthesis (translation), protein processing, protein transport, fatty acid biosynthesis, electron transport chains, cell wall synthesis, cell membrane production, synthesis and maintenance, and the like.

[0260] The term “expression” is defined as the production of a sense or antisense RNA molecule from a gene, gene fragment, genomic fragment, chromosome, operon or portion thereof. Expression can also be used to refer to the process of peptide or polypeptide synthesis.

[0261] An “expression vector” is defined as a vehicle by which a ribonucleic acid (RNA) is transcribed from a nucleic acid carried within the expression vehicle. The expression vector can also contain features that permit translation of a protein product from the transcribed RNA message expressed from the exogenous nucleic acid carried by the expression vector. Accordingly, an expression vector can produce an RNA molecule as its sole product or the expression vector can produce a RNA molecule that is ultimately translated into a protein product.

[0262] By “identifying a compound” is meant to screen one or more compounds in a collection of compounds such as a combinatorial chemical library, a natural product library or other library of chemical compounds or to characterize a single compound by testing the compound in a given assay and determining whether it exhibits the desired activity.

[0263] By “inducer” is meant an agent, solution or environmental condition which, when placed in contact with a cell or microorganism, increases transcription, or inhibitor and/or promoter clearance/fidelity, from a desired promoter.

[0264] By “inhibit activity of a gene or gene product” is meant having the ability to interfere with the function of a gene or gene product in such a way as to decrease expression of the gene, in such a way as to reduce the level or activity of a product of the gene or in such a way as to inhibit the interaction of the gene or gene product with other biological molecules required for its activity. Agents which inhibit the activity of a gene include agents that inhibit transcription of the gene, agents that inhibit processing of the transcript of the gene, agents that reduce the stability of the transcript of the gene, and agents that inhibit translation of the mRNA transcribed from the gene. In microorganisms, agents which inhibit the activity of a gene can act to decrease expression of the operon in which the gene resides or alter the folding or processing of operon RNA so as to reduce the level or activity of the gene product. The gene product can be a non-translated RNA such as ribosomal RNA, a translated RNA (mRNA) or the protein product resulting from translation of the gene mRNA. Of particular utility to the present invention are antisense RNAs that have activities against the operons or genes to which they specifically hybridize.

[0265] As used herein, “lac operator”, “lac operator sequence”, or “laco” is intended to encompass the operator having SED ID NO.: 11 and any other lac operator sequences consistent with the intended uses described herein.

[0266] As used herein, “λc1 operator”, “λc1 operator sequence”, or “λc1O” is intended to encompass all lambda operators to which the λc1 repressor or variants thereof are capable of binding. Such operators include, but are not limited to, λO_(R1) (SEQ ID NO.: 54), λO_(R2) (SEQ ID NO.: 55), λO_(R3) (SEQ ID NO.: 56), λO_(L1) (SEQ ID NO.: 57), λO_(L2) (SEQ ID NO.: 58), λO_(L3) (SEQ ID NO.: 59).

[0267] As used herein, “mal operator”, “mal operator sequence” or “malO” is intended to encompass the two mal operators MalMop (SEQ ID NO.: 60) and MalXop (SEQ ID NO.: 61), as described in Nieto et al., J. Biol. Chem. 272,30860-30865 (1997), the disclosure of which is incorporated herein by reference in its entirety, and any other mal operator sequences consistent with the intended uses described herein.

[0268] As used herein, “nucleic acid” means DNA, RNA, or modified nucleic acids. Thus, the terminology “the nucleic acid of SEQ ID NO: X” or “the nucleic acid comprising the nucleotide sequence” includes both the DNA sequence of SEQ ID NO: X, the complement thereof, and an RNA sequence in which the thymidines in the DNA sequence have been substituted with uridines in the RNA sequence and in which the deoxyribose backbone of the DNA sequence has been substituted with a ribose backbone in the RNA sequence.

[0269] As used herein, “polynucleotide” has the same meaning as nucleic acid.

[0270] As used herein, “proliferation-inhibiting” encompasses instances where the absence or substantial reduction of a gene transcript and/or gene product completely eliminates cell growth as well as instances where the absence of a gene transcript and/or gene product merely reduces cell growth. A proliferation-inhibiting antisense nucleic acid is one that can cause a reduction of a gene transcript and/or gene product that is sufficient to reduce or eliminate the growth or viability of the cell or microorganism.

[0271] As used herein, “proliferation-required” or “required for proliferation” encompasses instances where the absence or substantial reduction of a gene transcript and/or gene product completely eliminates cell growth as well as instances where the absence of a gene transcript and/or gene product merely reduces cell growth. A proliferation-required gene or gene family is one where, in the absence or substantial reduction of a gene transcript and/or gene product, growth or viability of the cell or microorganism is reduced or eliminated.

[0272] As used herein, “sub-lethal” means a concentration of an agent below the concentration required to inhibit all cell growth.

[0273] As used herein, “tetracycline analog” or “Tc analog” is intended to include compounds which are structurally related to tetracycline and which bind to the Tet repressor with a K_(a) of at least about 10⁻⁶ M. Preferably, the tetracycline analog binds with an affinity of about 10⁻⁹ M or greater. Examples of such tetracycline analogs include, but are not limited to, anhydrotetracycline (atc), doxycycline, chlorotetracycline, oxytetracycline and others disclosed by Hlavka and Boothe, “The Tetracyclines,” in Handbook of Experimental Pharmacology 78, R. K. Blackwood et al. (eds.), Springer-Verlag, Berlin, N.Y., 1985; L. A. Mitscher, “The Chemistry of the Tetracycline Antibiotics”, Medicinal Research 9, Dekker, N.Y., 1978; Noyee Development Corporation, “Tetracycline Manufacturing Processes” Chemical Process Reviews, Park Ridge, N.J., 2 volumes, 1969; R. C. Evans, “The Technology of the Tetracyclines”, Biochemical Reference Series 1, Quadrangle Press, New York, 1968; and H. F. Dowling, “Tetracycline”, Antibiotic Monographs, no. 3, Medical Encyclopedia, New York, 1955. Preferred Tc analog for high-level stimulation of transcription is anhydrotetracycline. For use in prokaryotic organisms, a Tc analog can be chosen which has reduced antibiotic activity compared to Tc. Examples of such Tc analogues are anhydrotetracycline, epioxytetracycline and cyanotetracycline.

[0274] As used herein, “Tet repressor” is intended to describe a protein occurring in nature which represses transcription from tet operator sequences in prokaryotic cells in the absence of Tc or Tc analog. Tetracycline repressors of the present invention may be obtained from any class of tetracycline repressor (e.g., TetR(A), TetR(B), TetR(C), TetR(D), TetR(E), TetR(G), TetR(H), TetR(J), TetR(Z), or combinations thereof). The nucleotide sequences of Tet repressors of representative members of the A, B, C, D and E classes are known (e.g., see Waters, S. H. et al. (1983) Nucl. Acids Res 11:6089-6105, Hillen, W. and Schollmeier, K. (1983) Nucl. Acids Res. 11:525-539 and Postle, K. et al. (1984) Nucl. Acids Res. 12:4849-4863, Unger, B. et al. (1984) Gene 31: 103-108, Unger, B. et al. (1984) Nucl Acids Res. 12:7693-7703 and Tovar, K. et al. (1988) Mol. Gen. Genet. 215:76-80, respectively).

[0275] As used herein, “tet operator”, “tet operator sequence” or “tetO” is intended to encompass all classes of tet operator sequences, e.g. (e.g., TetO(A) (SEQ ID NO.: 62), TetO(B) (SEQ ID NO.: 63), TetO(C) (SEQ ID NO.: 64), TetO(D) (SEQ ID NO.: 65), TetO(E) (SEQ ID NO.: 66), TetO(G), TetO(H), TetO(J), and TetO(Z). Nucleotide sequences of these five classes of tet operators are described in Waters, S. H. et al. (1983) cited supra, Hillen, W. and Schollenmeier, K. (1983) cited supra, Stuber, D. and Bujard, H. (1981) Proc. Natl. Acad. Sci. USA 78:167-171, Unger, B. et al. (1984) cited supra and Tovar, K. et al. (1988) cited supra.

[0276] As used herein, “trp operator”, “trp operator sequence” or “trpO” is intended to encompass all classes of trp operator sequences consistent with the intended uses described herein. One example is the trp operator of SEQ ID NO.: 67.

[0277] As used herein, “xyl operator”, “xyl operator sequence” or “xylO” is intended to encompass all classes of xyl operator sequences consistent with the intended uses described herein. One example is the xyl operator of SEQ ID NO.: 68.

BRIEF DESCRIPTION OF THE DRAWINGS

[0278]FIG. 1 is a sequence map showing the Xyl-T5 fusion promoter (SEQ ID NO.: 1) showing the following elements: (1) −45 AT Box; (2) −35 RNA polymerase recognition site; (3) a 17 bp intervening sequence that can be replaced by a second operator; (4) −10 RNA polymerase recognition site; (5) the xyl operator; (6) multiple cloning sequence; (7A) first rrnB terminator; and (7B) second rrnB terminator.

[0279]FIG. 2 is a sequence map showing the Xyl-CP25 fusion promoter (SEQ ID NO.: 2) showing the following elements: (1) a portion of the xylR gene; (2) −35 RNA polymerase recognition site; (3) a 17 bp intervening sequence that can be replaced by a second operator; (4) −10 RNA polymerase recognition site; and (5) the xyl operator. Also shown are recognition sites for the restriction enzymes (a) RsrII and (b) XhoI.

[0280]FIG. 3A illustrates one embodiment of the present promoter replacement method in which a linear double stranded promoter replacement construct comprising a nucleic acid encoding a selectable marker, a transcriptional terminator, and a regulatable fusion promoter is inserted into the chromosome of a cell expressing the RecE and RecT gene products of the Rac prophage or gene products having analogous functions.

[0281]FIG. 3B illustrates one embodiment of the present promoter replacement method in which a linear single stranded promoter replacement construct comprising a 5′ portion homologous to a target sequence, a 3′ portion homologous to a target sequence and a regulatable fusion promoter is inserted into the chromosome of a cell expressing the λ Beta and Gam proteins.

[0282]FIG. 4A illustrates one embodiment of the operator insertion method in which a linear double stranded regulatory element insertion construct comprising a nucleic acid encoding a selectable marker, a transcriptional terminator, and an operator is inserted into the chromosome of cells expressing the RecE and RecT gene products of the Rac prophage or gene products having analogous functions.

[0283]FIG. 4B illustrates one embodiment of the operator insertion method in which a linear single stranded regulatory element insertion construct comprising a 5′ portion homologous to a target sequence, a 3′ portion homologous to a target sequence and an operator is inserted into the chromosome of a cell expressing the λ Beta and Gam proteins.

[0284]FIG. 5 shows a plasmid map of the pXyl-T5 P15a Staphylococcus expression vector.

[0285]FIG. 6 shows a plasmid map for the pEPEF-X series of Enterococcus expression vectors.

[0286]FIG. 7 shows a bar graph depicting the ratio of RNA measured from xylose induced cells versus non-induced cells. RNA measurements were made in real time by RT-PCR and the samples were normalized for loading by using parallel 16S rRNA controls. Samples were prepared and analyzed in duplicate and the averages are plotted.

[0287]FIG. 8 shows a bar graph comparing the induced and uninduced activities of the Xyl-CP25, Xyl-P32, Xyl-P59 and Xyl-P1P2 fusion promoters in Enterococcus faecalis. The activity of each fusion promoter was determined by measuring the β-galactosidase activity produced by the expression of lacL-lacM reporter genes operably linked downstream of each fusion promoter.

[0288]FIG. 9 shows a bar graph comparing the induced and uninduced activities of the Xyl-T5 and Xyl-CP25 fusion promoters in Enterococcus faecalis. A comparison of the induced and uninduced activities of the Xyl-T5 fusion promoter in Staphylococcus aureus and Enterococcus faecalis is also shown. The activity of each fusion promoter was determined by measuring the β-galactosidase activity produced by the expression of lacL-lacM reporter genes operably linked downstream of each fusion promoter.

[0289]FIG. 10 shows a bar graph depicting the activity of the Xyl-P59 fusion promoter over a range of xylose concentrations from 0 to 10%. The activity of Xyl-P59 was determined by measuring the β-galactosidase activity produced by the expression of lacL-lacM reporter genes operably linked downstream of this fusion promoter.

[0290]FIG. 11 shows a bar graph comparing the induced and uninduced activities of the Xyl-P59, Xyl-PL, and modified Xyl-PL (Xyl-PL10) fusion promoters in Enterococcus faecalis. The activity of each fusion promoter was determined by measuring the β-galactosidase activity produced by the expression of lacL-lacM reporter genes operably linked downstream of each fusion promoter.

[0291]FIG. 12 is an IPTG dose response curve in E. coli transformed with an IPTG-inducible plasmid containing either an antisense clone to the E. coli ribosomal protein rplW (AS-rplW) which is required for protein synthesis and essential for cell proliferation, or an antisense clone to the elaD (AS-elaD) gene which is not known to be involved in protein synthesis and which is also essential for proliferation.

[0292]FIG. 13A is a tetracycline dose response curve in E. coli transformed with an IPTG-inducible plasmid containing antisense to rplW (AS-rplW) in the absence (0) or presence of IPTG at concentrations that result in 20% and 50% growth inhibition.

[0293]FIG. 13B is a tetracycline dose response curve in E. coli transformed with an IPTG-inducible plasmid containing antisense to elaD (AS-elaD)in the absence (0) or presence of IPTG at concentrations that result in 20% and 50% growth inhibition.

[0294]FIG. 14 is a graph showing the fold increase in tetracycline sensitivity of E. coli transfected with antisense clones to essential ribosomal proteins L23 (AS-rplW) and L7/L12 and L10 (AS-rplLrplJ). Antisense clones to genes known to not be directly involved in protein synthesis, atpB/E (AS-atpB/E ), visC (AS-visC), elaD (AS-elaD), yohH (AS-yohH), are much less sensitive to tetracycline.

[0295]FIG. 15 illustrates the results of an assay in which Staphylococcus aureus cells expressing an antisense nucleic acid complementary to the gyrB gene encoding the β subunit of gyrase were contacted with several antibiotics whose targets were known.

DETAILED DESCRIPTION OF THE INVENTION

[0296] The invention described herein involves the construction and use of fusion promoters that can be tightly and/or finely regulated (e.g., with a xyl, tet, mal, lac, trp, λc1, or other suitable operators), constructs and cells having these promoters, methods of expression of RNAs, peptides and proteins, methods to identify genes essential for the growth of bacteria, and methods to identify compounds that inhibit bacterial growth.

[0297] Some embodiments of this invention contemplate the preparation of several different bacterial fusion promoters which contain at least one operator associated with a promoter sequence. The at least one operator is positioned such that the binding of at least one repressor to the operator represses transcription from the promoter. The promoter sequence may be a wildtype promoter obtained from a bacterium, virus or other microorganism or it may be a promoter sequence that has been modified to have altered activity in Gram-positive organisms. One embodiment, for example, is a fusion promoter comprising a promoter sequence modified to have altered activity in at least one Gram-positive organism linked to one or more operators including, but not limited to, the xylose operator (“xylO”), the tetracycline operator (“tetO”), the maltose operator (“malO”), lactose operator (“lacO”), the tryptophan operator (“trpO”), and the lambda Cl operator (“λc1O”). These operators can be induced by the following inducers, respectively: xylose or analog thereof, tetracycline or analog thereof, maltose or analog thereof, lactose or analog thereof, tryptophan or analog thereof, and temperature.

[0298] In some embodiments, fusion promoters based on the bacteriophage T5 promoter and the xylose operator are contemplated. The T5 promoter has been shown to be capable of directing the efficient expression of genes primarily in Escherichia coli. (See e.g., Gentz et al, Proc. Natl. Acad. Sci. USA 78:4936-4940 (1981); Rommens et al., Nucleic Acids Res. 11:5921-5940 (1983); Wang et al., Gene 242:105-114 (2000); and U.S. Pat. Nos. 4,495,280 and 4,689,406) and the xylose operator/repressor system has been shown to regulate expression in some Gram-positive bacteria. (See e.g., Dahl et al., J. Mol. Biol. 243:413-424 (1994); Sizemore et al., J. Bacteriol. 174:3042-3048 (1992); Wieland et al., Gene 158:91-96 (1995); Kreuzer et al., J. Bacteriol. 171:3840-3845 (1989); Lokinan et al., J. Bacteriol. 179:5391-5397 (1997); and Erbeznik et al., J. Bacteriol. 180:1103-1109 (1998)).

[0299] In a preferred embodiment, xylO is operably linked to a T5 promoter such that transcription from the fusion promoter is inducible by agents that inhibit the binding of the xylose repressor (encoded by xylR) to the xylose operator (xylO). The xylO can be the xylO from S. xylosis or B. subtilis or a number of other bacteria. This embodiment, referred to as the Xyl-T5 promoter, can also contain several other elements including, but not limited to, AT-rich boxes (AT Box) that enhance the efficiency of gene expression in Gram-positive bacteria. The use of an AT-Box-enchanced T5 promoter has been described by D. J. Henner in “Expression of Heterologous Genes in Bacillus subtilis” Methods Enzymol., 195:199 (1990), the disclosure of which is incorporated herein by reference in its entirety.

[0300] In some embodiments fusion promoters containing one or more AT Boxes are contemplated. The AT Box is positioned such that it increases the level of transcription from the fusion promoter in Gram-positive bacteria. For the Xyl-T5 embodiment, a consensus −45 AT Box is shown in FIG. 1 at position (1). Many variations of the AT Box can be used. For example, the AT Box can have more or less adenine or thymine nucleotides or different sequences or combinations of adenine and thymine molecules. In this respect, the diversity of the AT box is limited only by the ability of this AT rich region to facilitate transcription in Gram-positive bacteria. In other embodiments of the present invention, fusion promoters constructed from T5 promoter derivatives which have been modified to have altered activity in Gram-positive organisms are contemplated.

[0301] A preferred embodiment of the Xyl-T5 fusion promoter is shown in FIG. 1. In italics, at positions (2) and (4) of FIG. 1 are shown the −35 and −10 RNA polymerase recognition sites (−35 Box and −10 Box), respectively. The 17 nucleotide long element shown at position (3) is a site in which a second operator can be inserted if desired. The second operator is positioned such that the binding of a repressor to the second operator represses transcription from the promoter. In some embodiments, the second operator may be the xylO, tetO, malO, lacO, trpO, or λc1O. At position (5) is shown the xyl operator (xylO), which is placed just downstream of the −10 Box. The distance between xylO and the T5 promoter can be made shorter or longer provided the ability to sustain repression of the T5 promoter is maintained. In some embodiments, a multiple cloning sequence (designated at position (6)) is also inserted downstream of the xyl operator to allow nucleic acids to be transcribed from the promoter when operably linked to the promoter. Many different multiple cloning sequences can be inserted at position (6) and the multiple cloning sequence shown is only one example. Additionally, some embodiments can include a terminator sequence, to facilitate termination of the RNA transcribed from the promoter at a desired position (e.g., the first rrnB terminator shown at position (7A) and/or the second rrnB terminator shown at position (7B)). Preferably, the terminator sequence allows for Rho independent transcription termination.

[0302] The present invention also contemplates fusion promoters based on promoter sequences discovered in Lactococcus lactis linked to at least one operator. For example, the Lactococcus lactis promoters CP25 (Jensen and Hammer, Appl. Environ. Microbiol. 64:82-87 (1998), the disclosure of which is incorporated herein by reference in its entirety), P32 and P59 (Bruberg et al., Appl. Microbiol. Biotechnol. 42:108-115 (1994), the disclosure of which is incorporated herein by reference in its entirety), P1P2 (Nilsson and Johansen, Biochimica Biophysica Acta. 1219:141-144 (1994), the disclosure of which is incorporated herein by reference in its entirety), and PL (Madsen et al., Appl. Environ. Microbiol. 67:1128-1139, the disclosure of which is incorporated herein by reference in its entirety) can be fused to to at least one operator. In some embodiments of the present invention, fusion promoters constructed from derivatives of the above Lactococcus lactis sequences which have been modified to have altered activity in Gram-positive organisms are contemplated.

[0303] It will be appreciated that a regulatable fusion promoter suite can be constructed which provides a wide dynamic range of both basal and maximal transcriptional activities. Each fusion promoter within this suite can be tightly and finely regulated over its own characteristic range of transcriptional activity. Furthermore, the regulatable fusion promoters within the suite can comprise different promoters operably linked to the same operators thus providing for a common means of regulation of the promoters within the suite. In some embodiments, the promoters within the set of regulatable fusion promoters within the suite can be linked to at least one operator contained in a promoterless expression vector that is capable of replicating in one or more Gram-positive organisms. Accordingly, a regulatable fusion promoter having an appropriate basal level and/or maximal level of transcriptional activity can be selected from the suite for use in the appropriate application.

[0304] In some embodiments, fusion promoters based on promoter sequences discovered in Lactococcus actively promote transciption in Enterococcus faecalis and other Gram-positive organisms. One embodiment is the Xyl-CP25 fusion promoter shown in FIG. 2. Position (1) of FIG. 2 shows a portion of the xylR gene located upstream from the fusion promoter. Positions (2) and (4) show the −35 and −10 Boxes of the CP25 promoter sequence, respectively. The 17 nucleotide long element shown at position (3) is a site in which a second operator can be inserted if desired. The second operator is positioned such that the binding of a repressor to the second operator represses transcription from the promoter. In some embodiments, the second operator may be the xylO, tetO, malO, lacO, trpO, or λc1O operator. At position (5) is shown the xyl operator (xylO), which is placed just downstream of the −10 Box. The distance between xylO and the CP25 promoter can be made shorter or longer provided the ability to sustain repression of the CP25 promoter is maintained. The positions of unique RsrII and XhoI recognition sites flanking the promoter are indicated as (a) and (b), respectively. In some embodiments, the RsrII and XhoI sites can be used to exchange the CP25 promoter with other promoter sequences, such as promoters selected from SEQ ID NOs.: 36-45.

[0305] In other embodiments of the fusion promoters described herein, elements such as multiple cloning sites (MCS) and transcription terminators can be linked downstream. Preferably, the terminator sequence allows for Rho independent transcription termination. For example the rrnB terminator (See e.g., Orosz et al., Eur. J. Biochem., 201:653-659 (1991), the disclosure of which is incorporated by reference in its entirety) may be utilized.

[0306] Some fusion promoters of the present invention function in Staphylcoccus aureus and Enterococcus faecalis. Other fusion promoters described herein function in Gram-positive organisms such as Bacillus anthracis, Clostridium botulinum, Clostridium difficile, Clostridium perfringens, Clostridium teteni, Corynebacterium diptheriae, Enterococcus faecalis, Enterococcus faecium, Lactococcus lactis, Listeria monocytogenes, Mycobacterium leprae, Mycobacterium tuberculosis, Nocardia asteroides, Staphylococcus aureus, Staphylococcus epidermidis, Staphylococcus xylosis, Streptococcus pneumoniae, Streptococcus mutans or any species falling within the genera of any of the above species.

[0307] In another aspect of the invention, two or more operators are linked to a promoter. The operators are positioned such that binding of repressors to the operators represses transcription from the promoter. The operators may be identical or different. By controlling gene expression with multiple operators (e.g., using a plurality of operators that can be selectively modulated in specific organisms), genetic information can be safely transferred between bacterial species and expression can be selectively induced and regulated. For example, repressors that bind to one or more of the operators may be expressed in a first bacterial species or strain but not in a second species. Advantageously, some embodiments maintain tight repression of a potentially lethal transcript in a first species or strain that is used to amplify the construct but the repression can be efficiently removed in a second species or strain in which transcription from the promoter is desired.

[0308] One embodiment, for example, concerns a fusion promoter that selectively enables the repression of transcription in Escherichia coli and Staphylococcus species. The “Xyl/Lac-T5” promoter comprises the lacO operably linked to the Xyl-T5 promoter so that transcription of the genetic information associated with the promoter can be induced in Staphylococcus by adding xylose or an analog thereof and/or in Escherichia coli by adding lactose or an analog thereof (e.g., IPTG). This selective regulation may be a consequence of the fact that Staphylococcus does not express the lac repressor that binds to the lacO while Escherichia coli does not express the xylR that binds to the xylO. In this manner, the construct can be efficiently amplified in Escherichia coli and retain the ability to express the desired transcript in Staphylococcus once the binding of xylR to xylO is inhibited by adding an inducer such as xylose or an analog thereof.

[0309] Similar multioperator constructs, which incorporate both lacO and xylO so as to provide regulatable expression in both Escherichia coli and Gram-positive organisms, can be made using promoter sequences described herein, including promoters selected from SEQ ID NOs.: 26-35.

[0310] By following the approaches described in the examples below, with modifications as would be apparent to one of skill in the art, the preparation of a fusion promoter comprising operators other than those described in the examples can be accomplished without undue burden or extensive experimentation.

[0311] The present invention also contemplates fusion promoters that have been modified to have altered activity in Gram-positive organisms. Modifications to fusion promoters can be made, for example, by synthesizing the modified DNA sequences, PCR based mutagenesis, site-directed mutagenesis techniques, DNA shuffling or any other techniques for making site specific mutations.

[0312] In some embodiments of the present invention, nucleotides in the −10, −35 or −45 Boxes of a promoter are changed to make those sequence elements look more or less like the consensus −10, −35 or −45 Box sequences in the desired Gram-positive organisms. Such modification can have the affect of either increasing or decreasing the activity of the fusion promoter.

[0313] It will be appreciated that a regulatable fusion promoter suite can be constructed which provides a wide dynamic range of both basal and maximal transcriptional activities. It will also be appreciated that modifying a regulatable fusion promoter so as to increase or decrease its activity can be used to expand the range of both basal and maximal transcriptional activities of a suite of regulatable fusion promoters. Each modified fusion promoter within this suite can be tightly and finely regulated over its own characteristic range of transcriptional activity. Furthermore, the modified regulatable fusion promoters within the suite can comprise different promoters operably linked to the same operators thus providing for a common means of regulation of the promoters within the suite. In some embodiments, the promoter within the set of modified regulatable fusion promoters within the suite can be linked to at least one operator contained in a promoterless expression vector that is capable of replicating in one or more Gram-positive organisms. Accordingly, a regulatable fusion promoter having an appropriate basal level and/or maximal level of transcriptional activity can be selected from the suite for use in the appropriate application.

[0314] In other embodiments, promoters are modified by altering the length of the region between the operator or transcription start site and the −10 Box, altering the length and/or composition of the region between the −10 Box and the −35 Box, and altering the length and/or composition of the AT rich region upstream of the −35 Box (i.e. the −45 Box). It will also be appreciated that modifications which affect promoter activity can include addition of elements such as the CRE region for catabolite repression in Gram-positive bacteria. These additional elements can be further modified so that they are more or less like their corresponding consensus sequence.

[0315] In some embodiments, modified fusion promoters can function in one or more Gram-positive organisms. For example, some modified fusion promoters of the present invention function in Staphycoccus aueus and Enterococcus faecalis. Other modified fusion promoters described herein function in Gram-positive organisms such as Bacillus anthracis, Clostridium botulinum, Clostridium difficile, Clostridium perfringens, Clostridium teteni, Corynebacterium diptheriae, Enterococcus faecalis, Enterococcus faecium, Lactococcus lactis, Listeria monocytogenes, Mycobacterium leprae, Mycobacterium tuberculosis, Nocardia asteroides, Staphylococcus aureus, Staphylococcus epidermidis, Staphylococcus xylosis, Streptococcus pneumoniae, Streptococcus mutans or any species falling within the genera of any of the above species.

[0316] In some embodiments of the present invention, the promoters described herein are used to initiate transcription of an RNA (sense or anti-sense) in bacteria, desirably Gram-positive bacteria. In another embodiment, random polynucleotides (e.g., nucleic acids generated by oligonucleotide synthesis, shearing of bacterial genomic DNA or enzymatic digestion, such as digestion with restriction endonucleases, DNase I or other nucleases) are operably linked to a promoter described herein. These constructs are transferred to bacteria and can be used to transcribe RNA.

[0317] In one embodiment of the present invention, a fusion promoter comprising an operator, such as xylO, and a promoter, such one selected from SEQ ID NOs.: 36-45, is operably linked to a nucleic acid to be transcribed from the promoter, the construct is introduced into bacteria, a solution of xylose or an analog thereof is provided (e.g., a 2.0% xylose solution), and induction of transcription proceeds. Preferred nucleic acids that are operably linked to the fusion promoter are at least 6, at least 10, at least 15, at least 20, at least 100, at least 200, at least 300, at least 400, at least 500, at least 800 or more than 800 nucleotides in length and, in some embodiments, the nucleic acids that are operably linked to the fusion promoter encode a transcript that is complementary to at least 6, at least 10, at least 15, at least 20, at least 100, at least 200, at least 300, at least 400, at least 500, at least 800 or more than 800 nucleotides within the genome of a Gram-positive organism, such as Staphylococcus or Enterococcus, which code for a gene product such as a protein or RNA. That is, nucleic acids that are operably linked to the fusion promoter can be in the “sense” or “antisense” orientation and can be at least 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 125, 150, 175, 200, 250, 300, 350, 400, 450, 500, 550, 600, 650, 700, 750, 800 or more than 800 nucleotides in length.

[0318] In another embodiment, the promoters described herein are used in an expression construct to initiate the production of a polypeptide or protein in bacteria, desirably Gram-positive bacteria. In some aspects of this embodiment, the peptide or protein generated corresponds to an expression product encoded by a full-length gene, a portion of a full-length gene, or a random polynucleotide or fragment of DNA generated by oligonucleotide synthesis or enzymatic digestion of genomic DNA (e.g., bacteria, yeast, or mold).

[0319] In one embodiment of the present invention, a fusion promoter comprising an operator, such as xylO, and a promoter, such one selected from SEQ ID NOs.: 36-45, is operably linked to a nucleic acid to be transcribed from the promoter, the construct is introduced into bacteria, a solution of xylose or an analog thereof is provided (e.g., a 2.0% xylose solution), and induction of transcription proceeds. This expression method can be performed in Gram-positive or Gram-negative bacteria and are desirably performed in Staphylococcus, Enterococcus or other Gram-positive species.

[0320] The DNA encoding the desired molecule can be generated by oligonucleotide synthesis or enzymatic cleavage of DNA (e.g., random fragments of bacterial genomic DNA including, but not limited to, Staphylococcus) The peptides expressed by using a fusion promoter can be polypeptides, proteins, and fusion proteins. The peptides can be, for example, 3-20 amino acids in length, 20-100 amino acids in length, or 100, 200, 300, 400, 500, 600, 700, 800, or more amino acids in length. That is, the peptides can be at least 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 125, 150, 175, 200, 250, 300, 350, 400, 450, 500, 550, 600, 650, 700, 750, 800 or more than 800 amino acids in length.

[0321] Regulatable promoters providing a range of transcriptional levels in both the repressed and induced states, such as the promoters described herein, are desirable in applications where production of defined levels of transcript or protein is necessary. For example, promoters which can be used to tightly and/or finely regulate transcription at a desired level are desirable for use in cell-based assays which employ sensitized cells for screening potential antibiotic compounds (described in Examples 19-25). Specifically, it is desirable to use regulatable fusion promoters, such as those described herein, to modulate the sensitivity of cells to test compounds by altering the amount of a proliferation-inhibiting antisense nucleic acid transcribed by the promoter. Because the results of such cell-based assays are affected by the degree to which the cells are sensitized, the efficacy of these assays can be increased by using promoters having the ability to finely regulate the level of proliferation-inhibiting transcript over a wide range of transcript concentrations.

[0322] The present invention contemplates fusion promoters which can be used to achieve the desired level of nucleic acid transcription or protein expression desired for applications such as the cell-based gene and drug discovery methods described herein. The appropriate fusion promoter can be selected by first determining the approximate level of transcript production that is required for the assay. For example, if only a low level of transcription is necessary, a promoter with a low maximum transcription efficiency may be used. If a tightly controlable transcription over a wide range is required, a fusion promoter having nonleaky repression and a high maximum transcription efficiency may used. The amount of actual transcription obtained from the fusion promoter can then be fine tuned by determining the concentration of inducer which achieves the desired effect. Inducer can be introduced to the cell cultures as a solid or a liquid. The inducer can also be an appropriate growth condition, including but not limited to, the absense of a regulatory molecule, a change in temperature or exposure to radiation. The inducer can comprise from 0% to greater than 25% of the concentration of the growth medium or can be an appropriate growth condition. In some enbodiments, the inducer comprises 0%, 0.1%, 0.2%, 0.3%, 0.4%, 0.5%, 0.6%, 0.7%, 0.8%, 0.9%, 1.0%, 1.5%, 2.0%, 3.0%, 4.0%, 5.0%, 6.0%, 7.0%, 8.0%, 9.0%, 10.0%, 12.0%, 15%, 17%, 20%, 25%, or greater than 25% of the concentration of the growth medium.

[0323] The present invention includes vectors containing fusion promoters operably linked to a reporter gene. Such expression vectors can be used to quantitate the transcriptional activity of the fusion promoter contained therein. In some embodiments, a promoter can be inserted into the reporter vector as a cassette so as to form a fusion promoter construct with an operator located on the vector upstream of the reporter gene. In a prefered embodiment, the β-galactosidase encoding genes, lacL-lacM, from Leuconostoc mesenteroides, (Israelsen et al., Appl. Environ. Microbiol. 61:2540-2547 (1995); Pedersen et al., Mol. Gen. Genet. 244:374-382 (1994), the disclosures of which are incorporated by reference in their entireties), are linked to a fusion promoter described herein. The present invention also contemplates methods for using the lacL-lacM genes as a reporter in Enterococcus faecalis.

[0324] Expression vectors containing fusion promoters linked upstream of a multiple cloning site (MCS) and/or a transcription terminator are also contemplated by the present invention. In a preferred embodiment the transcription terminator is a rho-independent terminator such as rrnBt1t2. In some embodiments of the present invention, a promoter sequence can be inserted into the expression vector as a cassette so as to form a fusion promoter construct with an operator located on the vector upstream of the MCS. In other embodiments, a promoter linked to an operator can be inserted into the expression vector as a cassette just upstream of the MCS.

[0325] Expression vectors containing fusion promoters can be used to produce transcripts, such as antisense RNA, or for the expression of proteins. In some embodiments, nucleic acids corresponding to proliferation-required genes or fragments thereof or random genomic fragments are inserted into the MCS in a sense or antisense orientation. Upon induction of the fusion promoter with an inducer, antisense or sense transcripts are produced. Expression of such transcripts from the fusion promoter can be tightly and/or finely regulated.

[0326] In a preferred embodiment of the present invention, reporter and expression vectors contain both an origin of replication that is functional in Escherichia coli and/or other Gram-negative organisms and an origin of replication that function in one or more Gram-positive organisms such as Bacillus anthracis, Clostridium botulinum, Clostridium difficile, Clostridium perfringens, Clostridium teteni, Corynebacterium diptheriae, Enterococcus faecalis, Enterococcus faecium, Lactococcus lactis, Listeria monocytogenes, Mycobacterium leprae, Mycobacterium tuberculosis, Nocardia asteroides, Staphylococcus aureus, Staphylococcus epidermidis, Staphylococcus xylosis, Streptococcus pneumoniae, Streptococcus mutans or any species falling within the genera of any of the above species. In some embodiments, an origin of replication is functional in Staphylococcus aureus and Enterococcus faecalis.

[0327] In some embodiments, nucleic acids that are operably linked to the fusion promoter encode a molecule (e.g., a sense RNA, anti-sense RNA, aptamer, or peptide) that inhibits proliferation of Gram-positive bacteria including, but not limited to, Bacillus anthracis, Clostridium botulinum, Clostridium difficile, Clostridium perfringens, Clostridium teteni, Corynebacterium diptheriae, Enterococcus faecalis, Enterococcus faecium, Lactococcus lactis, Listeria monocytogenes, Mycobacterium leprae, Mycobacterium tuberculosis, Nocardia asteroides, Staphylococcus aureus, Staphylococcus epidermidis, Staphylococcus xylosis, Streptococcus pneumoniae, Streptococcus mutans or any species falling within the genera of any of the above species. Certain embodiments include methods of identifying genes that are essential to bacterial proliferation (“essential genes”) whereby the constructs described above are selectively induced in bacteria, such as pathogenic bacteria, at various levels and the impact on cell proliferation is monitored. For example, the fusion promoters of the present invention may be utilized in any of the methods described in U.S. patent application Ser. No. 09/492,709, U.S. patent application Ser. No. 09/711,164, U.S. patent application Ser. No. 09/741,699, and U.S. patent application Ser. No. 09/815,242, the disclosures of which are incorporated by reference herein in their entireties.

[0328] One such method is practiced by introducing a candidate nucleic acid into a microorganism, wherein the candidate nucleic acid is operably linked to a fusion promoter comprising a xyl, tet, trp, mal, lac or λc1 operator and one of the promoter sequences described herein. The fusion promoter may comprise a promoter that has been modified to to alter its transcriptional activity in one or more Gram-positive bacteria including but not limited to Bacillus anthracis, Clostridium botulinum, Clostridium difficile, Clostridium perfringens, Clostridium teteni, Corynebacterium diptheriae, Enterococcus faecalis, Enterococcus faecium, Lactococcus lactis, Listeria monocytogenes, Mycobacterium leprae, Mycobacterium tuberculosis, Nocardia asteroides, Staphylococcus aureus, Staphylococcus epidermidis, Staphylococcus xylosis, Streptococcus pneumoniae, Streptococcus mutans or any species falling within the genera of any of the above species. In some embodiments, the candidate nucleic acid is generated by random oligonucleotide synthesis and in other embodiments, the candidate nucleic acid is generated by cleavage of genomic DNA from an organism (e.g., bacteria, viruses, fungi, mold, parasites, yeast, plants and animals). For example, the candidate nucleic acid may be a random fragment generated by methods such as DNaseI digestion, restriction endonuclease digestion or shearing. In some cases, the candidate nucleic acid will be made so that an antisense RNA, sense RNA, aptamer, peptide, or protein will be expressed upon induction of the fusion promoter. In some cases, the candidate nucleic acid will encode a transcript that corresponds to or is complementary to a gene present in the genome of an organism (e.g., bacteria, viruses, fungi, mold, parasites, yeast, plants and animals). In other cases, the candidate nucleic acid will encode a peptide that corresponds to or interacts with a protein in the bacteria.

[0329] In the embodiments in which an aptamer is transcribed upon induction of the fusion promoter, the aptamer may be a nucleic acid which directly binds to the target molecules. Alternatively, the aptamer may encode a peptide that binds to the target molecule. For example, the aptamer may encode a peptide of at least 4, 6, 8, 10, 15, 20, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70 or more than 70 amino acids in length. Aptamers which encode peptides have been described in PCT publication WO 99/50462, the disclosure of which is incorporated herein by reference in its entirety.

[0330] For example, RNA aptamers corresponding to regions of Tat and Rev RNA were introduced into or expressed in a cell and were shown to be specific RNA ligands for proteins that can specifically inhibit their translation. (Good et al., 1997, Gene Therapy 4: 45-54). Similarly, Conrad et al (J. Biol. Chem. 269: 32051, 1994) have shown that certain RNA aptamers can inhibit the βII isoenzyme of PKC. These RNA aptamers were selected from a pool of RNA molecules that contained a 120-nucleotide randomized region.

[0331] In general, the aspects of the invention that employ aptamer technology concern the use of the fusion promoters described herein to drive expression of aptamers of varying lengths and compositions. A minimum of approximately 6 nucleotides, preferably 10, and more preferably 14 or 15 nucleotides, may be necessary to effect specific binding to some target molecules. Thus, in some embodiments, fusion promoters of the invention are linked to oligonucleotides encoding aptamers that are 6, 10, 14, or 15 nucleotides in length. The only apparent limitations on the binding specificity of the target/aptamer complex concern sufficient sequence to be distinctive in the binding aptamer and sufficient binding capacity of the target molecule to obtain the necessary interaction. Aptamers of binding regions containing sequences shorter than 10, e.g., 6-mers, are feasible if the appropriate interaction can be obtained in the context of the environment in which the target is placed. Thus, if there are few interferences by other materials, less specificity and less strength of binding may be required.

[0332] Several methods for selecting aptamer oligonucleotides directed against a specific protein are known in the art and these approaches can be employed to design constructs having the fusion promoters linked to an oligonucleotide that encodes an aptamer. Examples of these methods include those methods described by Mills et al. Proc. Natl. Acad. Sci. USA 58:224 (1967), Green et al. Nature 347:406 (1990), and Tuerk and Gold Science 249:505 (1990), (all references are expressly incorporated by reference in their entireties), which show the use of in vitro evolution techniques to select RNA molecules with desired properties. Ellington and Szostak Nature 346:818 (1990), and Bock et al. Nature 355:564 (1992) (all references are expressly incorporated by reference in their entireties) also describe techniques useful for the selection of DNA oligonucleotides. More teachings on the design and selection of apatamers can be found in U.S. Pat. Nos. 5,756,291; 5,874,567; 5,874,281; 5,998,596; and 5,965,352 (all references are expressly incorporated by reference in their entireties). The field has also been recently reviewed by Burke and Berzal-Herranz FASEB J 7:106 (1993), herein expressly incorporated by reference in its entirety.

[0333] Once the construct comprising the candidate nucleic acid operably linked to a promoter of the invention is made, it is transferred to host bacteria. Next, the proliferation of microorganisms transcribing a higher level of the candidate nucleic acid is compared to the proliferation of microorganisms that transcribe a lower level of the candidate nucleic acid or that do not transcribe the candidate nucleic acid. These varying levels of transcription can be achieved by varying the amounts of inducer (e.g., xylose, tetracycline, maltose, the absense of tryptophan, temperature, exposure to ultraviolet radiation, or analogs thereof). The level of maximal induction that can be achieved will depend on the fusion promoter selected. A difference in cell proliferation in the sample transcribing a higher level of the candidate nucleic acid compared to the sample expressing a lower level of the candidate nucleic acid indicates that the candidate nucleic acid encodes a nucleic acid that is complementary to an essential gene or encodes a peptide or protein that reduces proliferation of the microorganism.

[0334] In some embodiments, the RNA, aptamer, peptide or protein reduces proliferation of the microorganisms by interacting with another molecule required for proliferation. In this case, many techniques in molecular biology can be employed to identify the cell proliferation molecule that interacts with the aptamer, peptide, or protein and the gene encoding this molecule. Conventional one and two hybrid systems, for example, can be readily adapted to identify molecules that bind to an aptamer, peptide, or protein described above, for example. Such approaches include:

[0335] (1) the two-hybrid systems (Field & Song, Nature 340:245-246 (1989); Chien et al., Proc. Natl. Acad. Sci. USA 88:9578-9582 (1991); and Young K H, Biol. Reprod. 58:302-311 (1998), all of which are expressly incorporated by reference in their entirety);

[0336] (2) the reverse two-hybrid system (Leanna & Hannink, Nucl. Acid Res. 24:3341-3347 (1996), herein incorporated by reference in their entirety);

[0337] (3) the repressed transactivator system (Sadowski et al., U.S. Pat. No. 5,885,779), herein incorporated by reference in their entirety);

[0338] (4) phage display systems (Lowman H B, Annu. Rev. Biophys. Biomol. Struct. 26:401-424 (1997), herein incorporated by reference in their entirety); and

[0339] (5) GST/HIS pull down assays, mutant operators (Granger et al., WO 98/01879) and the like (See also Mathis G., Clin. Chem. 41:139-147 (1995); Lam K. S. Anticancer Drug Res., 12:145-167 (1997); and Phizicky et al., Microbiol Rev. 59:94-123 (1995), all of which are expressly incorporated by reference in their entirety).

[0340] In one embodiment, the present invention utilizes an antisense-based method to identify proliferation-required sequences. Generally, a library of nucleic acids from a given source are subcloned or otherwise inserted immediately downstream of an inducible fusion promoter on an appropriate vector, such as one of the expression vectors described herein, thus forming an expression library. It is generally preferred that expression is directed by a regulatable fusion promoter such that expression level can be adjusted by addition of variable concentrations of an inducer molecule or of an inhibitor molecule to the medium. Temperature activated fusion promoters, such as promoters regulated by temperature sensitive repressors, such as the lambda C₁₈₅₇ repressor, are also envisioned. Although the insert nucleic acids may be derived from the chromosome of the cell or microorganism into which the expression vector is to be introduced, because the insert is not in its natural chromosomal location, the insert nucleic acid is an exogenous nucleic acid for the purposes of the discussion herein.

[0341] Once generated, the expression library containing the nucleic acids is introduced into a population of cells (such as the organism from which the nucleic acid sequences were obtained or an organism other than the organism from which the sequences were obtained) to search for genes that are required for bacterial proliferation. Because the library molecules are foreign, in context, to the population of cells, the expression vectors and the nucleic acid segments contained therein are considered exogenous nucleic acid.

[0342] Expression of the nucleic acid fragments in the test population of cells containing the expression library is then activated. Activation of the expression vectors consists of subjecting the cells containing the vectors to conditions that result in the expression of the nucleic acids carried by the expression library. The test population of cells is then assayed to determine the effect of expressing the nucleic acid fragments on the test population of cells. Those expression vectors that negatively impact the growth of the cells upon induction of expression of the random sequences contained therein are identified, isolated, and purified for further study.

[0343] A variety of assays are contemplated to identify nucleic acid sequences that negatively impact growth upon expression. In one embodiment, growth in cultures expressing nucleic acid sequences and growth in cultures not expressing these sequences is compared. Growth measurements are assayed by examining the extent of growth by measuring optical densities. Alternatively, enzymatic assays can be used to measure bacterial growth rates to identify nucleic acid sequences of interest. Colony size, colony morphology, and cell morphology are additional factors used to evaluate growth of the host cells. Those cultures that fail to grow or grow at a reduced rate under expression conditions are identified as containing an expression vector encoding a nucleic acid fragment that negatively affects a proliferation-required gene.

[0344] In some embodiments, genomic fragments of an organism are operably linked to an inducible fusion promoter in a vector and assayed for growth inhibition activity. Examples below describe the examination of libraries of genomic fragments cloned into vectors comprising inducible fusion promoters. Vectors containing fusion promoters, such as those described herein, can be used. Upon transformation of these vectors into a host organism and induction, the vectors produce an RNA molecule corresponding to the subcloned genomic fragments. In those instances where a genomic fragment is in an antisense orientation with respect to the fusion promoter, the antisense transcript produced can be complementary to at least a portion of a sense messenger RNA (mRNA) encoding a proliferation-required gene product such that the antisense transcript interacts with at least a portion of the proliferation-required sense mRNA produced by the host organism thereby decreasing the translation efficiency or the level of the proliferation-required sense mRNA and thus decreasing production of the proliferation-required protein encoded by the sense mRNA molecule. In cases where the sense mRNA encodes a protein required for proliferation, the host organism containing a vector from which transcription from the fusion promoter has been induced fails to grow or grows at a substantially reduced rate. Additionally, in cases where the transcript produced is complementary to at least a portion of a non-translated RNA and where that non-translated RNA is required for proliferation, host cells containing a vector from which transcription from the fusion promoter has been induced also fail to grow or grow at a substantially reduced rate.

[0345] In cases where the candidate nucleic acid encodes an antisense nucleic acid comprising a nucleotide sequence that is complementary to a gene required for proliferation or a portion thereof, the gene sequence of the coding fragment of the proliferation-required gene or a portion thereof can be determined by direct sequencing.

[0346] Determination of the gene(s) corresponding to the nucleotide sequence can achieved by comparing the obtained sequence data with databases containing known protein and nucleotide sequences from various microorganisms. Thus, initial gene identification can made on the basis of significant sequence similarity or identity to either characterized or predicted from Staphylococcus aureus and Enterococcus faecalis genes or their encoded proteins and/or homologues in other species.

[0347] The number of nucleotide and protein sequences available in database systems has been growing exponentially for years. For example, the complete nucleotide sequences of Caenorhabditis elegans and several bacterial genomes, including E. coli, Aeropyrum pernix, Aquifex aeolicus, Archaeoglobus fulgidus, Bacillus subtilis, Borrelia burgdorferi, Chlamydia pneumoniae, Chlamydia trachomatis, Clostridium tetani, Corynebacterium diptheria, Deinococcus radiodurans, Haemophilus influenzae, Helicobacter pylori 26695, Helicobacter pylori J99, Methanobacterium thermoautotrophicum, Methanococcus jannaschii, Mycobacterium tuberculosis, Mycoplasma genitalium, Mycoplasma pneumoniae, Pseudomonas aeruginosa, Pyrococcus abyssi, Pyrococcus horikoshii, Rickettsia prowazekii, Synechocystis PCC6803, Thermotoga maritima, Treponema pallidum, Bordetella pertussis, Campylobacter jejuni, Clostridium acetobutylicum, Mycobacterium tuberculosis CSU#93, Neisseria gonorrhoeae, Neisseria meningitidis, Pseudomonas aeruginosa, Pyrobaculum aerophilum, Pyrococcus furiosus, Rhodobacter capsulatus, Salmonella typhimurium, Streptococcus mutans, Streptococcus pyogenes, Ureaplasma urealyticum and Vibrio cholera are available. This nucleotide sequence information is stored in a number of databanks, such as GenBank, the National Center for Biotechnology Information (NCBI), the Genome Sequencing Center (http://genome.wustl.edu/gsc/salmonella.shtml), and the Sanger Centre (http://www.sanger.ac.uk/projects/S_typhi) which are publicly available for searching.

[0348] A variety of computer programs are available to assist in the analysis of the sequences stored within these databases. FASTA, (W. R. Pearson (1990) “Rapid and Sensitive Sequence Comparison with FASTP and FASTA” Methods in Enzymology 183:63-98), Sequence Retrieval System (SRS), (Etzold & Argos, SRS an indexing and retrieval tool for flat file data libraries. Comput. Appl. Biosci. 9:49-57, 1993) are two examples of computer programs that can be used to analyze sequences of interest. In one embodiment of the present invention, the BLAST family of computer programs, which includes BLASTN version 2.0 with the default parameters, or BLASTX version 2.0 with the default parameters, is used to analyze nucleotide sequences.

[0349] BLAST, an acronym for “Basic Local Alignment Search Tool,” is a family of programs for database similarity searching. The BLAST family of programs includes: BLASTN, a nucleotide sequence database searching program, BLASTX, a protein database searching program where the input is a nucleic acid sequence; and BLASTP, a protein database searching program. BLAST programs embody a fast algorithm for sequence matching, rigorous statistical methods for judging the significance of matches, and various options for tailoring the program for special situations. Assistance in using the program can be obtained by e-mail at blast@ncbi.nlm.nih.gov. tBLASTX can be used to translate a nucleotide sequence in all three potential reading frames into an amino acid sequence.

[0350] Bacterial genes are often transcribed in polycistronic groups. These groups comprise operons, which are a collection of genes and intergenic sequences under common regulation. The genes of an operon are transcribed on the same mRNA and are often related functionally. Given the nature of the screening protocol, it is possible that the identified exogenous nucleic acid corresponds to a gene or portion thereof with or without adjacent noncoding sequences, an intragenic sequence (i.e. a sequence within a gene), an intergenic sequence (i.e. a sequence between genes), a nucleotide sequence spanning at least a portion of two or more genes, a 5′ noncoding region or a 3′ noncoding region located upstream or downstream from the actual nucleotide sequence that is required for bacterial proliferation. Accordingly, it is often desirable to determine which gene(s) that is encoded within the operon is individually required for proliferation.

[0351] In one embodiment of the present invention, an operon is identified and then dissected to determine which gene or genes are required for proliferation. Operons can be identified by a variety of means known to those in the art. For example, the RegulonDB DataBase described by Huerta et al. (Nucl. Acids Res. 26:55-59, 1998), which may also be found on the website http://www.cifn.unam.mx/Computational_Biology/regulondb/, the disclosures of which are incorporated herein by reference in their entireties, provides information about operons in Escherichia coli. The Subtilist database (http://bioweb.pasteur.fr/GenoList/SubtiList), (Moszer, I., Glaser, P. and Danchin, A. (1995) Microbiology 141: 261-268 and Moszer, 1 (1998) FEBS Letters 430: 28-36, the disclosures of which are incorporated herein in their entireties), may also be used to predict operons. This database lists genes from the fully sequenced, Gram-positive bacteria, Bacillus subtilis, together with predicted promoters and terminator sites. This information can be used in conjunction with the Staphylococcus aureus genomic sequence data to predict operons and thus produce a list of the genes affected by the antisense nucleic acids of the present invention. The TIGR microbial database has an incomplete version of the Enterococcus faecalis genome http://www.tigr.org/cgi-bin/BlastSearch/blast.cgi?organism=e_faecalis. One can take a nucleotide sequence and BLAST it for homologs.

[0352] A number of techniques that are well known in the art can be used to dissect the operon. Analysis of RNA transcripts by Northern blot or primer extension techniques are commonly used to analyze operon transcripts. In one aspect of this embodiment, gene disruption by homologous recombination is used to individually inactivate the genes of an operon that is thought to contain a gene required for proliferation.

[0353] Several gene disruption techniques have been described for the replacement of a functional gene with a mutated, non-functional (null) allele. These techniques generally involve the use of homologous recombination. One technique using homologous recombination in Staphylococcus aureus is described in Xia et a. 1999, Plasmid 42: 144-149, the disclosure of which is incorporated herein by reference in its entirety. This technique uses crossover PCR to create a null allele with an in-frame deletion of the coding region of a target gene. The null allele is constructed in such a way that nucleotide sequences adjacent to the wild type gene are retained. These homologous sequences surrounding the deletion null allele provide targets for homologous recombination so that the wild type gene on the Staphylococcus aureus chromosome can be replaced by the constructed null allele. Enterococcus faecalis genes can be disrupted by recombining in a non-replicating plasmid that contains an internal fragment to that gene (Leboeuf, C., L. Leblanc, Y. Auffray and A. Hartke. 2000. J. Bacteriol. 182:5799-5806, the disclosure of which is incorporated herein by reference in its entirety).

[0354] The crossover PCR amplification product is subcloned into a suitable vector having a selectable marker, such as a drug resistance marker. In some embodiments the vector may have an origin of replication which is functional in Escherichia coli or another organism distinct from the organism in which homologous recombination is to occur, allowing the plasmid to be grown in Escherichia coli or the organism other than that in which homologous recombination is to occur, but may lack an origin of replication functional in Staphylococcus aureus or Enterococcus faecalis such that selection of the selectable marker requires integration of the vector into the homologous region of the Staphylococcus aureus or Enterococcus faecalis chromosome. Usually a single crossover event is responsible for this integration event such that the Staphylococcus aureus or Enterococcus faecalis chromosome now contains a tandem duplication of the target gene consisting of one wild type allele and one deletion null allele separated by vector sequence. Subsequent resolution of the duplication results in both removal of the vector sequence and either restoration of the wild type gene or replacement by the in-frame deletion. The latter outcome will not occur if the gene should prove essential. A more detailed description of this method is provided below. It will be appreciated that this method may be practiced with any genes identified as required for proliferation in any of the organisms described herein.

[0355] Bacteria that harbor the expression vectors described herein can be induced to transcribe a desired level of RNA or to express a desired level of peptide, which affects a gene that is essential for the viability and/or growth of the cell and renders the cells more sensitive to compounds that reduce cellular proliferation. This technique is referred to as Essential Gene Interference (EGI). In one embodiment, for example, an RNA, such as an antisense RNA, or peptide generated from the constructs described above interacts with an essential gene of a bacteria, RNA, or product thereof and when the level of transcript or peptide is increased in the cell, viability and/or cell growth is compromised. Thus, the genes that are affected by the transcript or peptide either directly or indirectly (e.g., by interacting with the RNA or protein that correspond to the gene) causing a reduction in viability and/or cell growth are identified as essential genes. These essential genes are then used as drug discovery targets for high throughput drug screening.

[0356] The promoters described herein can also be used to replace endogenous promoters so as to create cells having specific regulatable genes. By this approach, an essential gene is identified (e.g., using the techniques described herein) and the endogenous promoter is replaced with a regulatable promoter of the invention (e.g, Xyl-T5, Xyl-T5-DD, Xyl-CP25, Xyl-P32, Xyl-P59, Xyl-P59*, Xyl-P1P2, Xyl-P1P2*, Xyl, PL, and XylPL10) using ET cloning technology. This technique is referred to as Rapid Replacement of Genomic Promoters (promoter replacement). This method employs the Polymerase Chain Reaction (PCR) to replace the promoter of the target gene without interrupting the endogenous Shine-Dalgamo sequence. As little as 60 bp homology to the target promoter can be used. The correct integration of the cassette is confirmed by colony PCR and correct integrants are tested for growth defects when an inducer is titrated away. The inability to grow in the absence of inducer confirms the essentiality of the target gene. Additionally, the bacterial strains created using this approach can be used in high throughput drug screening methodologies.

[0357] A similar technique to promoter replacement can be used to insert an operator into a genomic promoter region so as to make the genomic promoter regulatable. This technique is referred to as operator insertion. As little as 60 bp homology to the target promoter region can be used. The correct integration of the cassette is confirmed by colony PCR and correct integrants are tested for growth defects when an inducer is titrated away. The inability to grow in the absence of inducer confirms the essentiality of the target gene. Additionally, the bacterial strains created using this approach can be used in high throughput drug screening methodologies.

[0358] In some embodiments, promoter replacement and operator insertion methods are conducted in cells which have an increased frequency of homologous recombination. For example, the organism may lack or have a reduced level or activity of one or more exonucleases which would ordinarily degrade the DNA to be inserted into the chromosome, thereby effectively increasing the stability of the linear DNA in the organism.

[0359] In still further embodiments, the strains may both lack or have reduced levels of exonucleases and constitutively or conditionally express proteins involved in mediating homologous recombination. For example, Gram-positive bacterial strains in which the activity of an exonuclease which degrades linear nucleic acids, such as exonuclease V of the RecBCD recombination pathway or an exonuclease with an analogous function, has been reduced or eliminated, may be used. Such strains include those having mutations in the recB, recC, or recD genes or genes that express proteins having analogous functions, that enhance the frequency of homologous recombination. In some embodiments, the strains have mutations in more than one of the recB, recC, or recD genes or genes that express proteins having analogous functions, that enhance the frequency of homologous recombination. For example the strains may have mutations in both the recB and recC genes or genes that express proteins having analogous functions.

[0360] It will be appreciated that Gram-positive organisms having reduced or eliminated levels of one or more proteins analogous in function to RecB, RecC, or RecD may be used with the methods described herein. In some embodiments, Gram-positive organisms having reduced or eliminated levels of one or more proteins analogous in function to RecB, RecC or RecD may be selected from the group consisting of Bacillus anthracis, Clostridium botulinum, Clostridium difficile, Clostridium perfringens, Clostridium teteni, Corynebacterium diptheriae, Enterococcus faecalis, Enterococcus faecium, Lactococcus lactis, Listeria monocytogenes, Mycobacterium leprae, Mycobacterium tuberculosis, Nocardia asteroides, Staphylococcus aureus, Staphylococcus epidermidis, Staphylococcus xylosis, Streptococcus pneumoniae, Streptococcus mutans or any species falling within the genera of any of the above species. In some embodiments, it may be desirable to use Escherichia coli having reduced or eliminated levels of one or more of RecB, RecC or RecD proteins as a surrogate host in the methods described herein.

[0361] The methods may also be performed in Gram-positive bacterial strains in which the activity of a the RecET recombinase system of the Rac prophage, or a recombinase system having an analogous function, has been activated. Such strains are analogous to Escherichia coli strains which carry an sbcA mutation. The RecE gene of the Rac prophage encodes ExoVIII a 5′-3′ exonuclease, while the RecT gene of the Rac prophage encodes a single stranded DNA binding protein which facilitates renaturation and D-loop formation. Thus, the gene products of the RecE and RecT genes or proteins with analogous functions facilitate homologous recombination. The RecE and RecT genes are on the same operon and are not normally expressed. Similarly, genes having a function analogous to the RecE and RecT may not be expressed. However, strains having a mutation analogous to sbcA would activate the expression of RecE and RecT or genes having analogous function. In some embodiments, the methods may be performed in strains which carry mutations in the recB and recC genes or genes that express proteins having analogous functions, as well as the mutation in the gene analogous in function to sbcA.

[0362] In some embodiments, the methods may be performed in Gram-positive bacterial strains in which recombination via the RecF pathway or a pathway having an analogous function has been enhanced. One example of such strains are those having a mutation analogous in function to the Escherichia coli sbcB mutation.

[0363] It will be appreciated that the recE and recT gene products, or proteins with analagous functions may be constitutively or conditionally expressed in Gram-positive organisms. In some embodiments, these proteins may be conditionally or constitutively expressed in Gram-positive organisms such as Bacillus anthracis, Clostridium botulinum, Clostridium difficile, Clostridium perfringens, Clostridiurn teteni, Corynebacterium diptheriae, Enterococcus faecalis, Enterococcus faecium, Lactococcus lactis, Listeria monocytogenes, Mycobacterium leprae, Mycobacterium tuberculosis, Nocardia asteroides, Staphylococcus aureus, Staphylococcus epidermidis, Staphylococcus xylosis, Streptococcus pneumoniae, Streptococcus mutans or any species falling within the genera of any of the above species. In some embodiments, Gram-positive strains may be genetically engineered to express the RecE and recT genes of the Rac prophage. For example, plasmids encoding these gene products may be introduced into the organism. If desired, the coding sequences encoding these gene products may be optimized to reflect the codon preferences of the organism in which they are to be expressed. In some embodiments, it may be desirable to use Escherichia coli constitutively or conditionally expressing the RecE and RecT gene products as a surrogate host in the methods described herein.

[0364] In further embodiments, the methods may be conducted in cells which utilize the Red system of bacteriophage lambda (λ) or analogous systems from other phages to enhance the frequency of homologous recombination. The Red system contains three genes, γ, β and exo whose products are the Gam, Bet and Exo proteins (see Ellis et al. PNAS 98:6742-6746, 2001, the disclosure of which is incorporated herein by reference in its entirety). The Gam protein inhibits the RecBCD exonuclease V, thus permitting Bet and Exo to gain access to the ends of the DNA to be integrated and facilitating homologous recombination. The Bet protein is a single stranded DNA binding protein. The Exo protein is a double-stranded DNA dependent 5′-3′ exonuclease. Thus, expression of the λ red proteins or proteins having analogous functions facilitates homologous recombination.

[0365] It will be appreciated that the λ Bet, Gam and Exo proteins, or proteins with analagous functions may be constitutively or conditionally expressed in Gram-positive organisms. In some embodiments, these proteins may be conditionally or constitutively expressed in Gram-positive organisms such as Bacillus anthracis, Clostridium botulinum, Clostridium difficile, Clostridium perfringens, Clostridium teteni, Corynebacterium diptheriae, Enterococcus faecalis, Enterococcus faecium, Lactococcus lactis, Listeria monocytogenes, Mycobacterium leprae, Mycobacterium tuberculosis, Nocardia asteroides, Staphylococcus aureus, Staphylococcus epidermidis, Staphylococcus xylosis, Streptococcus pneumoniae, Streptococcus mutans or any species falling within the genera of any of the above species. For example, plasmids encoding these gene products may be introduced into the organism. If desired, the coding sequences encoding these gene products may be optimized to reflect the codon preferences of the organism in which they are to be expressed. In some embodiments, it may be desirable to use Escherichia coli constitutively or conditionally expressing the λ Bet, Gam and Exo gene products as a surrogate host in the methods described herein.

[0366] In some embodiments, the cells may have an increased frequency of homologous recombination as a result of more than one of the aforementioned characteristics. In some embodiments, the enhanced frequency of recombination may be a conditional characteristic of the cells which depends on the culture conditions in which the cells are grown. For example, in some embodiments, expression of the λ Red Gam, Exo, and Bet proteins or recE and recT proteins may be constitutive or regulated. Thus, the cells may have an increased frequency of homologous recombination as a result of any combination of the aforementioned characteristics. For example, in some embodiments, the cell may carry the sbcA and recBC mutations or analogous mutations.

[0367] In some embodiments, a linear double stranded DNA containing the regulatable fusion promoter, operator or other regulatory element to be inserted into the chromosome of the prokaryotic organism is introduced into an organism constitutively or conditionally expressing the recE and recT or the λ Bet, Gam and Exo proteins or proteins with analogous functions as described above. In some embodiments, the double stranded DNA may be introduced into an organism having the recBC and sbcA mutations or analogous mutations.

[0368] In other embodiments, a single stranded DNA containing the regulatable fusion promoter, operator or other regulatory element to be inserted into the chromosome of the prokaryotic organism is introduced into an organism constitutively or conditionally expressing the λ Bet protein or a protein with an analogous function. In some embodiments the single stranded DNA is introduced into an organism constitutively or conditionally expressing both the k Bet and Gam proteins or proteins with analogous functions. In further embodiments, the single stranded DNA is introduced into an organism constitutively or conditionally expressing the λ Bet, Gam and Exo proteins or proteins with analogous functions.

[0369] It will be appreciated that any method of enhancing the frequency of homologous recombination may be used in conjunction with the present invention and that the methods of the present invention are not limited to use in cells having the aforementioned characteristics.

[0370] The 5′ and 3′ regions of the linear nucleic acid to be integrated via homologous recombination may have any length of homology to the target sequence in the chromosome of the organism which permits homologous recombination to occur. In some embodiments, the nucleic acid to be integrated via homologous recombination is generated by PCR. In such embodiments, it is desirable for the PCR amplicon to be relatively short. Accordingly, in some embodiments, the 5′ and 3′ regions of the amplicon which have homology to the target sequence in the chromosome of the organism, may each be more than 120 nucleotides in length, at least 120 nucleotides in length, at least 100 nucleotides in length, at least 90 nucleotides in length, at least 80 nucleotides in length, at least 70 nucleotides in length, at least 60 nucleotides in length, at least 50 nucleotides in length, at least 40 nucleotides in length, at least 30 nucleotides in length or less than 30 nucleotides in length.

[0371] As discussed above, the linear double stranded DNA or linear single stranded DNA may be introduced into a suitable strain having an enhanced frequency of recombination.

[0372] In one embodiment of the present invention, genes required for proliferation are identified by methods in which the native promoter of a gene is replaced with a regulatable fusion promoter using homologous recombination. In such methods, a linear promoter replacement construct which comprises a 5′ portion homologous to a first portion of the native promoter, a 3′ portion homologous to a second portion of the native promoter and a regulatable fusion promoter disposed between the 5′ portion and the 3′ portion is introduced into a cell containing the target gene. After replacement of the native promoter in the target gene with the regulatable fusion promoter via homologous recombination, the extent of proliferation of cells in which the regulatable fusion promoter has replaced the native promoter under conditions in which the regulatable fusion promoter is active at a first level is compared to the extent of proliferation of the cells under conditions in which the regulatable fusion promoter is active at a second level which is lower than the first level. If the candidate gene is required for proliferation, the extent of proliferation of the cells grown under conditions where the fusion promoter is active at the higher level will be greater than the extent of proliferation of cells grown under conditions where the fusion promoter is active at the lower level.

[0373] The regulatable fusion promoter in the promoter replacement construct may be any of the regulatable fusion promoters described herein. For example, in some embodiments, the regulatable fusion promoter may be selected from the group consisting of Xyl-T5, Xyl-T5-DD, Xyl-CP25, Xyl-P32, Xyl-P59, Xyl-P59*, Xyl-P1P2, Xyl-P1P2*, Xyl-PL, and Xyl-PL10.

[0374] In some embodiments, the linear promoter replacement construct may further comprise a fusion promoter operably linked to a nucleic acid which encodes an identifiable marker or a selectable marker. The promoter which is operably linked to a nucleic acid encoding an identifiable marker or a selectable marker and the nucleic acid which encodes the identifiable marker or selectable marker may be 5′ of the regulatable fusion promoter. The identifiable marker may be any protein whose expression is readily detectable. For example, the identifiable marker may be β-galactosidase, or any of the markers familiar to those skilled in the art which allow identification of cells expressing the marker. Likewise, the selectable marker may be any protein which allows selection of cells which express the protein. For example, the selectable marker may confer resistance to an antibiotic or permit growth under a particular set of culture conditions. A variety of selectable markers are familiar to those skilled in the art, including genes which confer resistance to antibiotics such as chloramphenicol, ampicillin, kanamycin and tetracycline.

[0375] In some embodiments, a transcriptional terminator, such as the rrnB terminator may be positioned between the 3′ end of the nucleic acid encoding the identifiable marker or selectable marker and the 5′ end of the regulatable fusion promoter.

[0376] The promoter replacement method may be performed in any desired organism. For example, in some embodiments, the method may be practiced in Gram-positive bacteria. In certain embodiments, the cell to be used with any of the methods described herein is selected from the group consisting of Bacillus anthracis, Clostridium botulinum, Clostridium difficile, Clostridium perfringens, Clostridium teteni, Corynebacterium diptheriae, Enterococcus faecalis, Enterococcus faecium, Lactococcus lactis, Listeria monocytogenes, Mycobacterium leprae, Mycobacterium tuberculosis, Nocardia asteroides, Staphylococcus aureus, Staphylococcus epidermidis, Staphylococcus xylosis, Streptococcus pneumoniae, Streptococcus mutans or any species falling within the genera of any of the above species.

[0377] In some embodiments, the promoter replacement method is conducted in cells which have an increased frequency of homologous recombination. For example, in some embodiments, the method is conducted in cells which carry the sbcA and recBC mutations, or in cells having mutations which confer similar phenotypes. In other embodiments, the cells may express one or more proteins in the λ Red recombination system. For example, in some embodiments the cells may constitutively or conditionally express the λ Gam, Bet, and Exo proteins or other proteins having similar functions. In other embodiments, the cells may constitutively or conditionally express the λ Bet protein or a protein having a similar function.

[0378] In some embodiments, a double stranded promoter replacement construct is introduced into a cell comprising the recBCD and sbcA mutations, recBC and sbcA mutations or mutations with analogous consequences. The double stranded promoter replacement construct may or may not comprise a gene encoding a selectable or identifiable marker, but in preferred embodiments the double stranded promoter replacement construct includes a gene enocding a selectable or identifiable marker. In other embodiments, a double stranded promoter replacement construct is introduced into a cell constitutively or conditionally expressing the recE and recT proteins or proteins with analogous functions. In still other embodiments, a double stranded promoter replacement construct is introduced into a cell constitutively or conditionally expressing the λ Gam, Bet, and Exo proteins or other proteins having similar functions. In such embodiments, cells in which the promoter replacement construct has integrated are identified by obtaining colonies which express the identifiable marker or by growing the cells in appropriate selective media. If desired, homologous integration may be confirmed by performing a Southern blot or amplification reaction using a probe or primers which will yield a known result if homologous recombination has occurred.

[0379] In other embodiments, a promoter replacement construct may be introduced into the cell as a single stranded DNA. Preferably, the single stranded promoter replacement construct does not comprise a gene which encodes an identifiable or selectable marker, but in some embodiments a gene which encodes an identifiable or selectable marker may be included in the construct. In some embodiments, the single stranded nucleic acid may be generated by in vitro synthesis. In other embodiments, a circular single stranded DNA, such as M13, comprising the promoter replacement construct may be fragmented by sonication and the fragments may be introduced into the cell. In still other embodiments, a single stranded promoter replacement construct is introduced into a cell constitutively or conditionally expressing the λ Bet protein, the Bet and Gam proteins, the Bet, Gam and Exo proteins or other proteins having similar functions. Cells in which integration has occurred at the desired location are identified by performing colony hybridization, amplification reactions, or other methods familiar to those skilled in the art.

[0380] As discussed above, the 5′ and 3′ portions of the linear promoter replacement construct which have homology to the promoter region of the target gene may have any length which permits homologous recombination to occur. Accordingly, in some embodiments, the 5′ and 3′ regions of the amplicon which have homology to the promoter region of the target gene in the chromosome of the organism, may each be more than 120 nucleotides in length, at least 120 nucleotides in length, at least 100 nucleotides in length, at least 90 nucleotides in length, at least 80 nucleotides in length, at least 70 nucleotides in length, at least 60 nucleotides in length, at least 50 nucleotides in length, at least 40 nucleotides in length, at least 30 nucleotides in length or less than 30 nucleotides in length.

[0381] In some embodiments, the promoter replacement construct is introduced into the cell as a linear nucleic acid. In particular, in some embodiments, the promoter replacement construct is generated by an conducting amplification reaction, such as a PCR reaction, to generate a linear amplicon. In some aspects of this embodiment, the PCR reaction is performed on a linearized plasmid. In a preferred aspect of this embodiment, the amplicon contains about 60 bp of homology to the target gene at both the 5′ and 3′ ends of the construct.

[0382] Thus, one embodiment of the invention relates to a method for identifying a gene which is required for proliferation of a cell comprising replacing the native promoter of a gene with a regulatable fusion promoter and identifying cells in which the extent of proliferation of the cell when the fusion promoter is active at a first level is substantially different than the extent of proliferation of the cell when the fusion promoter is active at a second level, the first level being greater than the second level. In some embodiments, the cell has an enhanced frequency of homologous recombination. In some embodiments, the fusion promoter is an inducible promoter. In other embodiments, the level of transcription from the promoter varies with the level of inducer present in the medium in which the cell is grown. In certain embodiments, the step of replacing the native promoter comprises introducing a linear nucleic acid comprising a 5′ portion homologous to a first portion of the native promoter, a 3′ portion homologous to a second portion of the native promoter and the regulatable promoter disposed between the 5′ portion and the 3′ portion into a cell such that homologous recombination occurs between the 5′ portion and the first portion of the native promoter and between the 3′ portion and the second portion of the native promoter. The nucleic acid may, in some embodiments, further comprise a gene encoding a selectable marker. The regulatable fusion promoter may be any of the fusion promoters described herein. In some embodiments, the 5′ portion comprises at least 60 nucleotides homologous to the first portion of the native promoter and the 3′ portion comprises at least 60 nucleotides homologous to the second portion of the native promoter. Certain embodiments include those in which the nucleic acid is linear. In some embodiments, the step of identifying cells comprises identifying cells which proliferate to a greater extent in the presence of inducer than in the absence of inducer. In other embodiments, the Shine Delgamo sequence of said native promoter is not replaced.

[0383]FIG. 3A illustrates one embodiment of the present promoter replacement method in which a linear double stranded promoter replacement construct comprising a nucleic acid encoding a selectable marker, a transcriptional terminator, and a regulatable fusion promoter is inserted into the chromosome of a cell expressing the RecE and RecT gene products of the Rac prophage or gene products having analogous functions. FIG. 3B illustrates one embodiment of the present promoter replacement method in which a linear single stranded promoter replacement construct comprising a 5′ portion homologous to a target sequence, a 3′ portion homologous to a target sequence and a regulatable fusion promoter is inserted into the chromosome of a cell expressing the λ Beta and Gam proteins.

[0384] As discussed below, the promoter replacement method may also be used to obtain cells in which the level or activity of a gene product required for proliferation is reduced. Such cells may be used in cell-based assays to identify compounds which inhibit proliferation.

[0385] In another embodiment of the present invention, genes required for proliferation are identified by methods in which a an operator is integrated into the promoter of a target gene or into the target gene itself via homologous recombination. The operator may be any nucleic acid which allows the activity or level of the gene product encoded by the target gene to be modulated. For example, in some embodiments, the operator may be recognized by a repressor. In such methods, a linear operator insertion construct which comprises a 5′ portion homologous to a first portion of the target promoter or gene, a 3′ portion homologous to a second portion of the target promoter or gene and an operator disposed between the 5′ portion and the 3′ portion is introduced into a cell containing the target gene. After integration of the operator, expression of the target gene is under the control of the operator. For example, in some embodiments, the operator may be recognized by a transcriptional repressor. Numerous operators recognized by transcriptional repressors, such as the operators described herein, including the lac operator, the tet operator, the xyl operator, λc1 operator, the mal operator, or the trp operator, may be used.

[0386] After insertion of the operator in the target promoter or gene via homologous recombination, the extent of proliferation of cells in which the operator has been inserted under conditions in which the promoter is active at a first level is compared to the extent of proliferation of the cells under conditions in which the promoter is active at a second level which is lower than the first level. If the candidate gene is required for proliferation, the extent of proliferation of the cells grown under conditions where the promoter is active at the higher level will be greater than the extent of proliferation of cells grown under conditions where the promoter is active at the lower level.

[0387] The operator may be integrated at any location in which the activity or level of the gene product encoded by the target gene is modulated by the operator. For example, the operator may be integrated at any location in which expression of the target gene will be regulated by the effector which acts on the operator. For example, if the operator is integrated into a bacterial cell, the operator may be placed upstream of the −35 box, between the −35 box and −10 box, or downstream of the −10 box (e.g.overlapping the transcription start site). In some embodiments, the operator is integrated so as to maintain the appropriate relationships on the DNA helix between the transcription factors which direct expression from the promoter which is placed under the control of the operator.

[0388] As discussed above, preferably the cells into which the operator is to be integrated have an enhanced frequency of homologous recombination. In some embodiments, the operator may be introduced into the cell as linear double stranded DNA. Such DNA may be generated by an amplification reaction, such as a PCR reaction. In some aspects of this embodiment, the PCR reaction is performed on a linearized plasmid. In such embodiments, the cells may constitutively or conditionally express the λ Red recombinase system, comprising the Gam, Exo, and Bet proteins or an analogous system from another phage. Alternatively, the cells may comprise the recBCD and sbcA mutations, recBC and sbcA mutations or analogous mutations. In some embodiments, the cells constitutively or conditionally express the recE and recT proteins or proteins with analogous functions. In some embodiments, the cells comprise the recBCD and sbcA mutations, recBC and sbcA mutations, or analogous mutations, and constitutively or conditionally express the recE and recT proteins, or analogous proteins.

[0389] In some embodiments, the operator may be introduced into the cells as linear single stranded DNA. Such DNA may be synthesized in vitro or generated by sonication of a circular single stranded nucleic acid such as an M13 construct comprising the operator insertion construct. In some embodiments, the cells may constitutively or conditionally express the Bet protein of the λ Red recombinase system or an analogous protein from another phage. In still further embodiments, the expression of the proteins in the λ Red recombinase system, the Bet protein from the λ Red recombinase system, or analogous proteins from other phage may be conditional.

[0390] In some embodiments, the cells may be any of the cells described above which have an increased frequency of homologous recombination or any other cells having an increased frequency of homologous recombination which are familiar to those of skill in the art.

[0391] In some embodiments, the 5′ and 3′ portions of the linear operator insertion construct which have homology to the promoter region of the target gene or the target gene may have any length which permits homologous recombination to occur. Accordingly, in some embodiments, the 5′ and 3′ regions of the construct which have homology to the promoter region of the target gene or the target gene, may each be more than 120 nucleotides in length, at least 120 nucleotides in length, at least 100 nucleotides in length, at least 90 nucleotides in length, at least 80 nucleotides in length, at least 70 nucleotides in length, at least 60 nucleotides in length, at least 50 nucleotides in length, at least 40 nucleotides in length, at least 30 nucleotides in length or less than 30 nucleotides in length. For example, in some embodiments, the 5′ and 3′ regions of the construct which have homology to the target promoter or gene may be about 30 nucleotides in length.

[0392] Alternatively, in some embodiments, the operator insertion construct may be a single stranded nucleic acid, such as a synthetic oligonucleotide. Preferably, the single stranded nucleic acid is introduced into cells constitutively or conditionally expressing a protein which stabilizes single stranded DNA, such as the λ Bet protein or an analogous protein, as described in Ellis et al., PNAS 98: 6742-6746, 2001, the disclosure of which is incorporated herein by reference in its entirety. After introduction of the operator insertion construct into a suitable cell, cells in which the operator has integrated by homologous recombination are identified. The cells may be identified by performing an amplification reaction or Southern blot. Alternatively, to identify cells in which expression of a gene required for proliferation is regulated by the inserted operator, cells which grow well under conditions in which the promoter which is regulated by the operator is active at a higher level but not under conditions in which the promoter is active at a lower level are identified. For example, cells which grow on media containing an inducer which increases the level of transcription from the promoter but not on media in which transcription is repressed may be identified. In some embodiments, the operator may be the lac operator and cells which grow on media containing IPTG but not on media lacking IPTG may be identified. In some embodiments, the operator may be the tet operator and cells which grow on media containing tetracycline but not on media lacking tetracycline may be identified. In some embodiments, the operator may be the xyl operator and cells which grow on media containing xylose but not on media lacking xylose may be identified. In some embodiments, the operator may be the mal operator and cells which grow on media containing maltose but not on media lacking maltose may be identified. In some embodiments, the operator may be the trp operator and cells which grow on media lacking tryptophan but not on media containing tryptophan may be identified. In some embodiments, the operator may be the λc1 operator and cells which grow at the premissive temperature but not at the restrictive temperature may be identified.

[0393] The operator insertion method provides a fast and inexpensive way to engineer directed insertions into the chromosome. In some embodiments, the operator insertion construct may lack a nucleic acid encoding an identifiable or selectable marker. The method allows insertions of only the desired nucleic acid sequence without the necessity of introducing additional nucleic acid sequences such as sequences encoding identifiable or selectable markers. In addition, in embodiments where the native promoter is placed under the control of an operator, the expression level under conditions in which the promoter is not repressed or activated will be the physiological level.

[0394] The operator insertion method may be performed in any desired prokaryotic organism. For example, in some embodiments, the method may be practiced in Gram-positive bacteria. In certain embodiments, the cell to be used with any of the methods described herein is selected from the group consisting of Bacillus anthracis, Clostridium botulinum, Clostridium difficile, Clostridium perfringens, Clostridium teteni, Corynebacterium diptheriae, Enterococcus faecalis, Enterococcus faecium, Lactococcus lactis, Listeria monocytogenes, Mycobacterium leprae, Mycobacterium tuberculosis, Nocardia asteroides, Staphylococcus aureus, Staphylococcus epidermidis, Staphylococcus xylosis, Streptococcus pneumoniae, Streptococcus mutans or any species falling within the genera of any of the above species.

[0395] In some embodiments, the operator insertion method is conducted in cells which have an increased frequency of homologous recombination. For example, in some embodiments, the method is conducted in cells which carry the sbcA and recBC mutations, or in cells having mutations which confer similar phenotypes. In other embodiments, the cells may constitutively or conditionally express one or more proteins in the λ Red recombination system. For example, in some embodiments the cells may constitutively or conditionally express the λ Gam, Bet, and Exo proteins or other proteins having similar functions. In other embodiments, the cells may constitutively or conditionally express the λ Bet protein or a protein having a similar function.

[0396] In some embodiments, the promoter region for a target gene/operon is identified either experimentally or bioinformatically. An oligonucleotide is designed which matches at least 90, at least 70, at least 60, or at least 50 residues of the promoter region. An operator is engineered to be present in the middle of the oligonucleotide sequence. Thus, the operator will have at least 45, at least 35, at least 30, or at least 25 nucleotides of homology to the target region on each side. Examples of operators include the lac, tet, xyl, mal, trp, or λc1 operators, though any other operator known in the art can function. The oligonucleotide can be designed such that the operator is inserted: i) upstream of the −35 box, ii) in between the −35 and the −10 box, or iii) downstream of the −10 box.

[0397] In some embodiments, the oligonucleotides are introduced into cells that conditionally or constitutively express the λ Red (or homologous phage) recombination system, which consists of the λ Gam, Exo, and Bet proteins (Ellis et al., PNAS 98:6742-6746 (2001)). This recombination system efficiently recombines oligonucleotide DNA into the bacterial chromosome. Specifically, in some embodiments, the cells constitutively or conditionally express the λ Bet protein, which is a single strand binding protein that promotes annealing of complementary single strand and can mediate strand exchange, or another protein with an analogous function. Transformants are then plated on non-selective plates. To ascertain if a transforming is a recombinant, it is replica plated on plates that contain and lack the appropriate inducer. The plates are screened for an expected phenotype. For example, if the xyl operator is inserted downstream of the −10 box of an essential gene, then one would predict that recombinants would only be able to grow in the presence of inducer (xylose). For genes of known or expected phenotypes, e.g., those resulting in growth, transformants could be rapidly screened by plating on inducing (de-repressing) plates and then replica plating transformants on plates containing or lacking inducer. Those transformants that do not grow in the absence of inducer would contain the desired operator insertion.

[0398] Recombinants could also be verified by Southern or PCR analysis. This method is amenable to high throughput and thus many targeted recombinants could be constructed in a short time.

[0399] The operator insertion method also has the advantage of being portable to other bacteria by conditionally inducing the expression of the λ Bet protein. Additionally, this method could be employed to test if a gene is essential by screening for conditional growth, as discussed herein.

[0400] In some embodiments, the operator insertion method is conducted in cells which have an increased frequency of homologous recombination. For example, in some embodiments, the method is conducted in cells which carry the sbcA and recBC mutations, or in cells having mutations which confer similar phenotypes. In other embodiments, the cells may constitutively or conditionally express one or more proteins in the λ Red recombination system. For example, in some embodiments the cells may constitutively or conditionally express the λ Gam, Bet, and Exo proteins or other proteins having similar functions. In other embodiments, the cells may constitutively or conditionally express the λ Bet protein or a protein having a similar function.

[0401] In some embodiments, a double stranded operator insertion construct is introduced into a cell comprising the recBCD and sbcA mutations, recBC and sbcA mutations or mutations with analogous consequences. The double stranded operator insertion construct may or may not comprise a gene encoding a selectable or identifiable marker, but in preferred embodiments the double stranded operator insertion construct includes a gene enocding a selectable or identifiable marker. In other embodiments, a double stranded operator insertion construct is introduced into a cell constitutively or conditionally expressing the recE and recT proteins or proteins with analogous functions. In still other embodiments, a double stranded operator insertion construct is introduced into a cell constitutively or conditionally expressing the λ Gam, Bet, and Exo proteins or other proteins having similar functions. In such embodiments, cells in which the operator insertion construct has integrated are identified by obtaining colonies which express the identifiable marker or by growing the cells in appropriate selective media. If desired, homologous integration may be confirmed by performing a Southern blot or amplification reaction using a probe or primers which will yield a known result if homologous recombination has occurred.

[0402] In other embodiments, a single stranded operator insertion construct is introduced into the cell. Preferably, the single stranded operator insertion construct does not comprise a gene which encodes an identifiable or selectable marker, but in some embodiments a gene which encodes an identifiable or selectable marker may be included in the construct. In some embodiments, the single stranded nucleic acid may be generated by in vitro synthesis. In other embodiments, a circular single stranded DNA, such as M13, comprising the operator insertion construct may be fragmented by sonication and the fragments may be introduced into the cell. In still other embodiments, a single stranded operator insertion construct is introduced into a cell constitutively or conditionally expressing the λ Bet protein, the Bet and Gam proteins, the Bet, Gam and Exo proteins or other proteins having similar functions. Cells in which integration has occurred at the desired location are identified by performing colony hybridization, amplification reactions, or other methods familiar to those skilled in the art.

[0403]FIG. 4A illustrates one embodiment of the operator insertion method in which a linear double stranded regulatory element insertion construct comprising a nucleic acid encoding a selectable marker, a transcriptional terminator, and an operator is inserted into the chromosome of cells expressing the RecE and RecT gene products of the Rac prophage or gene products having analogous functions. FIG. 4B illustrates one embodiment of the operator insertion method in which a linear single stranded regulatory element insertion construct comprising a 5′ portion homologous to a target sequence, a 3′ portion homologous to a target sequence and an operator is inserted into the chromosome of a cell expressing the λ Beta and Gam proteins.

[0404] As discussed below, the operator insertion method may also be used to obtain cells in which the level or activity of a gene product required for proliferation is reduced. Such cells may be used in cell-based assays to identify compounds which inhibit proliferation.

[0405] In still another embodiment, a vector containing an outward-directed inducible promoter is randomly integrated into the genome of bacteria and clones containing these promoters that exhibit a reduction in viability and/or cell growth are identified. This technique is referred to as Random Insertional Cis-Antisense (random promoter insertion). This strategy involves the use of constructs that contain an inducible promoter (e.g., one of the promoters described herein) and a selectable marker to randomly integrate into the genome of a bacteria. When such constructs are integrated into or near an essential gene, induction of transcription from the inducible promoter will frequently reduce or eliminate normal expression of the essential gene. This event is particularly likely if the vector integrates such that transcription from the inducible promoter is in an antisense orientation relative to normal transcription of the endogenous gene. The integration in such cases is likely to be 3′ to the gene, within the 3′ untranslated region of the gene, within the coding region of the gene, or in the case of eukaryotes, within an intron of the gene. Induction of transcription in such integrants will dramatically interfere with normal sense transcription of the gene.

[0406] Since the expression of essential genes is required for proliferation of the organism, when such integration events occur within or near an essential gene, induction of transcription will be lethal or will severely impair proliferation of the organism. Such conditional lethal or conditional growth clones can be readily isolated and the genes into which the vector has integrated can be determined by the cloning and sequencing techniques described herein. In this manner, the random promoter insertion method can be used to discover essential bacterial genes.

[0407] Dominant negative mutants are another class of conditional lethal mutants that can be identified by using the random promoter insertion technique. These mutants can be generated when the vector integrates such that transcription from the inducible promoter is in the sense orientation relative to normal transcription of the endogenous gene. In this case the integration site can be 5′ to the gene, within the 5′ untranslated region of the gene, within the coding region of the gene, or in the case or eukaryotes, within an intron of the gene. Because the dominant negative mutants will involve certain classes of essential genes, the dominant negative mutants are useful for identifying a subset of essential genes. Such conditional dominant negative mutants can also be employed in a cell-based assay.

[0408] By one approach, a vector having the outward-directed inducible promoter is linearized by cutting with a restriction endonuclease and the linearized vector is transformed into bacteria. Sites of integration will be random but in many cases, the inducible promoter is oriented such that it produces an antisense RNA. Transformants are robotically picked, grown, and gridded onto inducing and non-inducing media (e.g., media having and lacking xylose, media having and lacking tetracycline, and media having and lacking maltose). The gridded plates are then screened for transformants that exhibit a growth defect on inducing media relative to the phenotype on non-inducing media. Transformants that have a reduced viability and/or cell growth in the presence of the inducer harbor a promoter that has integrated proximal to an essential gene. Once identified, the clone is sequenced to determine the identity of the essential gene. The genes that are affected by the random promoter insertion either directly or indirectly (e.g., by interacting with the RNA or protein that corresponds to the gene) causing a reduction in viability and/or cell growth are used as drug discovery targets for high throughput drug screening.

[0409] In some embodiments of the present invention, complementary to genes required for the proliferation of a host organism from which the antisense molecules were originally obtained may be used to identify homologous antisense nucleic acids and the coding nucleic acids complementary thereto from cells or microorganisms other than the original host organism, to inhibit the proliferation of cells or microorganisms other than the original host organism by inhibiting the activity or reducing the amount of the identified homologous coding nucleic acid or homologous polypeptide in the cell or microorganism other than the original host organism, or to identify compounds which inhibit the growth of cells or microorganisms other than the original host organism as described below. For example, antisense nucleic acids complementary to proliferation-required genes from original host organism may be used to identify compounds which inhibit the growth of Gram-positive organisms such as Bacillus anthracis, Clostridium botulinum, Clostridium difficile, Clostridium perfringens, Clostridium teteni, Corynebacterium diptheriae, Enterococcus faecalis, Enterococcus faecium, Lactococcus lactis, Listeria monocytogenes, Mycobacterium leprae, Mycobacterium tuberculosis, Nocardia asteroides, Staphylococcus aureus, Staphylococcus epidermidis, Staphylococcus xylosis, Streptococcus pneumoniae, Streptococcus mutans or any species falling within the genera of any of the above species.

[0410] In one embodiment of the present invention, antisense nucleic acids complementary to the sequences identified as required for proliferation or portions thereof are operably linked to a regulatable fusion promoter, such as one of the fusion promoters described herein, contained in a vectors capable of replicating within a species other than the species from which the proliferation-required sequences were obtained. Expression vectors which contain a fusion promoter and one or more origins of replication that are functional in one or more organisms, such as the expression vectors described herein, may be used. For example, the vector may be capable of replicating and/or producing transcripts in one or more of the following Gram-positive organisms selected from the group consisting of Bacillus anthracis, Clostridium botulinum, Clostridium difficile, Clostridium perfringens, Clostridium teteni, Corynebacterium diptheriae, Enterococcus faecalis, Enterococcus faecium, Lactococcus lactis, Listeria monocytogenes, Mycobacterium leprae, Mycobacterium tuberculosis, Nocardia asteroides, Staphylococcus aureus, Staphylococcus epidermidis, Staphylococcus xylosis, Streptococcus pneumoniae, Streptococcus mutans or any species falling within the genera of any of the above species.

[0411] Following the subcloning of the antisense nucleic acids, which are complementary to proliferation-required sequences or portions thereof from the host organism from which the stabilized antisense nucleic acids were originally obtained, into an expression vector under control of a regulatable fusion promoter capable of replicating and producing stabilized transcripts in a second cell or microorganism of interest (i.e. a cell or microorganism other than the one from which the identified nucleic acids were obtained), the antisense nucleic acids are conditionally transcribed to test for bacterial growth inhibition. The coding sequences complementary to the antisense nucleic acids from the original host that, when transcribed, inhibit growth of the second cell or microorganism are compared to the known genomic sequence of the second cell or microorganism to identify the homologous gene from the second organism. If the homologous sequence from the second cell or microorganism is not known, it may be identified and isolated by hybridization to the proliferation-required sequence of interest from the original host organism or by amplification using PCR primers based on the proliferation-required nucleotide sequence of interest. In this way, nucleotide sequences which may be required for the proliferation of the second cell or microorganism may be identified. For example, the second microorganism may be selected from the group consisting of Bacillus anthracis, Clostridium botulinum, Clostridium difficile, Clostridium perfringens, Clostridium teteni, Corynebacterium diptheriae, Enterococcus faecalis, Enterococcus faecium, Lactococcus lactis, Listeria monocytogenes, Mycobacterium leprae, Mycobacterium tuberculosis, Nocardia asteroides, Staphylococcus aureus, Staphylococcus epidermidis, Staphylococcus xylosis, Streptococcus pneumoniae, Streptococcus mutans or any species falling within the genera of any of the above species.

[0412] The homologous nucleic acid sequences from the second cell or microorganism which are identified as described above may then be operably linked to a promoter, such as one of the regulatable fusion promoter described herein, in an antisense orientation and introduced into the second cell or microorganism. The techniques described herein for identifying genes required for proliferation may thus be employed to determine whether the identified nucleotide sequences from a second cell or microorganism inhibit the proliferation of the second cell or microorganism. For example, the second microorganism may be selected from the group consisting of Bacillus anthracis, Clostridium botulinum, Clostridium difficile, Clostridium perfringens, Clostridium teteni, Corynebacterium diptheriae, Enterococcus faecalis, Enterococcus faecium, Lactococcus lactis, Listeria monocytogenes, Mycobacterium leprae, Mycobacterium tuberculosis, Nocardia asteroides, Staphylococcus aureus, Staphylococcus epidermidis, Staphylococcus xylosis, Streptococcus pneumoniae, Streptococcus mutans or any species falling within the genera of any of the above species. If the stabilized antisense nucleic acid inhibits the proliferation of the second cell or microorganism, a cell-based assay, such as the one described herein, can be used to test and identify candidate antibiotic compounds.

[0413] In another embodiment of the present invention, screening of candidate antibiotic compounds can be performed directly by using the antisense molecule isolated from the original host organism. In this embodiment, an antisense nucleic acid comprising a nucleic acid complementary to the proliferation-required sequences from the original host organism or a portion thereof is transcribed in the second host from a regulatable fusion promoter such as one of the fusion promoters described herein. If the antisense molecule is transcribed so as to sufficiently alter the level or activity of a nucleic acid required for proliferation of the second host, the second host may be used directly in a cell-based assay, such as those described herein, to identify candidate antibiotic compounds.

[0414] The techniques for identifying drug discovery targets described above can be used to generate bacterial strains for high throughput drug screening methodologies. Because the promoters described herein can be tightly and/or finely regulated over a wide range of maximal and basal transcription levels, one can selectively synthesize a precise amount of transcript or peptide sufficient to sensitize the cells without achieving complete lysis of cell population. For example, bacterial strains comprising the fusion promoters described herein are created using the EGI, promoter replacement, operator insertion, and random promoter insertion techniques. By carefully adjusting the amount of inducer in the growth media, an optimal balance of slowed growth and sensitivity to compounds can be achieved. Libraries of test compounds are then screened against the sensitized cells, the effect on cell growth is monitored, and compounds that further reduce viability and/or cell growth are identified. Through subsequent rounds of drug design, medicinal chemistry, and analysis, optimal antibiotics are developed using this inventive method.

[0415] Other embodiments concern an in vivo system for determining whether a gene is required for infection of a host by a bacterium. Such a system can also be used for validating the drug discovery targets and lead compounds identified by the approaches described above. Accordingly, bacterial strains (e.g., including bacterial strains such as Bacillus anthracis, Clostridium botulinum, Clostridium difficile, Clostridium perfringens, Clostridium teteni, Corynebacterium diptheriae, Enterococcus faecalis, Enterococcus faecium, Lactococcus lactis, Listeria monocytogenes, Mycobacterium leprae, Mycobacterium tuberculosis, Nocardia asteroides, Staphylococcus aureus, Staphylococcus epidernidis, Staphylococcus xylosis, Streptococcus pneumoniae, Streptococcus mutans or any species falling within the genera of any of the above species) having essential genes under the control of the regulatable promoters described herein are created using the EGI technology, promoter replacement technology, operator insertion technology or the random promoter insertion technology and these bacteria are used to inoculate a suitable host animal (e.g., neutropenic mice). Two hours after inoculation, an experimental group of animals are either provided a solution containing inducer (if the EGI technology or random promoter insertion technology was used to create the compromised cells) or a solution containing a minimal amount of inducer (if the promoter replacement technology or operator insertion technology was used to create the compromised cells). Accordingly, the bacteria are induced to generate the transcript or peptide that interacts with the essential gene or gene product or transcription of an endogenous essential gene is reduced. Subsequently, the amount of viable bacteria remaining in the animal or tissue sample from the animal is determined. By using this approach, one can rapidly evaluate bacterial strains carrying essential genes that can be selectively downregulated in vivo. Additionally, one can use these animal models to screen lead compounds in vivo so as to identify new antibiotics.

EXAMPLE 1 Construction of Fusion Promoters Having a Single Operator

[0416] This example describes the construction of the fusion promters that can be used to drive transcription of an RNA and/or expression of a peptide or protein in bacteria in a tightly regulated fashion. Many of the promoters described herein can function in multiple bacterial species including Gram-positive organisms such as Staphylococcus and Enterococcus species.

[0417] For construction of the Xyl-T5 fusion promoter, which is functional, for example, in Staphylcoccus aureus, a S. aureus/E. coli shuttle vector, referred to as pRNLX2, was initially constructed. First, the multiple cloning site was removed from the S. aureus vector pRN5548, obtained from Richard Novick at the Skirball Institute at New York University, by digesting the vector with HindIII. The remaining vector sequence was self-ligated, which resulted in the formation of a new vector referred to as pRNHIII. Next, the pRNHIII vector was digested with SalI and the DNA was treated with Calf intestine alkaline phosphatase (CIP). Subsequently, the SalI linearized and CIP-treated pRNHIII vector was ligated to a 2.6 kb XhoI/SalI fragment of pLex5BA, which contains the plasmid origin and ampicillin resistance marker. (Diederich, L. et al., BioTechniques 16: 916-923 (1994), the disclosure of which is incorporated herein by reference in its entirety). This vector, referred to as pRNLX2, contains both a S. aureus and an E. coli origin of replication. Thus, the vector can replicate in both Gram-negative and Gram-positive bacteria.

[0418] A xyl expression system was then constructed using the pRNLX2 shuttle vector. First, the xyl repressor gene and promoter were obtained from the plasmid pWH942 using PCR with the flanking primers 5′TGGCATTCTACGACTATAAC 3′ (SEQ ID No.: 3) and 5′GCGAATTCTACATTTTAGTTGGTTAATTT 3′ (SEQ ID No.: 4). (Schnappinger, D. et al., FEMS Microbiology Letters 129: 121-128 (1995), the disclosure of which is incorporated herein by reference in its entirety). These primers contain EcoRI sites near their 5′ ends and result in a PCR amplicon with trimmable EcoRI sites. The amplicons were digested with EcoRI and, subsequently, ligated into the pRN5548 vector, which had been linearized by digestion with EcoRI and SwaI. The resulting plasmid, called pXylRN, was used as a source for the xyl fragment for the final construct. The xyl fragment was excised from pXylRN with EcoRI and PshAI and was ligated to pRNLX2 that had been linearized with EcoRI and PvuII. The structure of the resultant plasmid was confirmed by sequencing the insert from both sides.

[0419] In the final step, the xyl promoter from pKLO20 was removed and replaced by the Xyl-T5 promoter. The xyl promoter and operator of pKLO20 is flanked by an RsrII site at its 5′ end and the EcoR1site of the multiple cloning site at its 3′ end. The promoter and operator were removed by digesting the plasmid with RsrII and EcoRI. The larger vector fragment of this digestion was purified from the smaller promoter fragment by preparative gel electrophoresis and a pair of oligonucleotides were designed having the following sequences: 5′GACCGTCATAAAAAATTTATTTGCTTTCAGGAAAATTTTTCTGTATAATA (SEQ ID No.:5) GATTCAAGTTAGTTTGTTTATTAAATTAACCAACTAAAATGTAG3′ 5′AATTCTACATTTTAGTTGGTTAATTTAATAAACAAACTAACTTGAATCTAT (SEQ ID No.:6) TATACAGAAAAATTTTCCTGAAAGCAAATAAATTTTTTATGACG3′.

[0420] These oligonucleotides were annealed and, thereby, formed a double-stranded fragment. This fragment included: the RsrII sticky end (underlined), an AT rich region, the T5 promoter, the xylO, and the EcoRI sticky end (italicized). This fragment was then combined with the purified RsrII-EcoRI vector fragment of pKLO20 and ligated to form the pXyl-T5 vector.

[0421] To facilitate the construction of fusion promoters that function in Enterococcus faecalis, a reporter plasmid having a swapable promoter module located between a xyl repressor gene (xylR) and xyl operator (xylO) was designed. The Lactococcus lactis promoter probe vector pAK80 (described in Israelsen, et al. Appl. Environ. Microbiol. 61, 2540-47 (1995), the disclosure of which is incorporated by reference in its entirety), which contains replicons functional in both Escherichia coli (p15A origin) and Enterococcus faecalis (PCT 1138 replicon) as well as a selectable marker for erythromycin resistance (erm), was digested at the unique HindIII and BamHI polylinker sites just upstream from the promoterless Leuconostoc mesenteroides β-galactosidase reporter genes (lacL-lacM). After digestion, the 11 kb vetor fragment was purified from the 26 bp polylinker by preparative gel electrophoresis then ligated to a 1535 bp gel purified HindIII/BamHI fragment from pEP25 (SEQ ID NO: 7) which contains the CP25 promoter from Lactococcus lactis (SEQ ID NO.: 38) flanked upstream by the xylose repressor gene (xylR) and downstream by the xylose operator sequence (xylO). A portion of the resulting ligation mixture was transformed into competent Escherichia coli XL-1 Blue cells (Stratagene, La Jolla, Calif.) and 20 μl, 75 μl or 100 μl of the transformation mixture was plated on BYGT medium containing 20 μg/ml erythromycin. Isolated erythromycin-resistant transformants were picked and streaked to obtain single colony isolates. Plasmid DNA was then purified from representative single colony transformants. The presence of the chimeric plasmid pEPEF1 (SEQ ID NO.: 8) was confirmed by digesting purified plasmid preparations with HindIII/BamHI and detecting an excised 1535 bp fragment, which corresponds to the fragment containing the Xyl-CP25 fusion promoter, by gel electrophoresis.

[0422]FIG. 2 shows a region of pEPEF1 that contains the Xyl-CP25 fusion promoter. Immediately adjacent to the 5′ end of the CP25 promoter and just downstream of xylR lies a unique RsrII recognition site (element (a) in FIG. 2). A unique recognition site for XhoI (element (b) in FIG. 2) lies immediately adjacent to the 3′ end of the CP25 promoter just upstream from xylO. The placement of these two unique restriction sites allow the CP25 promoter to be removed from pEPEF1 and replaced with promoter sequences from Lactococcus or other microbial species. For example, the CP25 promoter can be replaced with promoters having SEQ ID NOs.: 36-45.

[0423] In one example, a pEPEF1 derivative was created by replacing the CP25 promoter with the P59 promoter from Lactococcus lactis. Two complementary DNA oligonucleotides corresponding to the sequence of the P59 promoter were synthesized (SEQ ID NOs.: 9 and 10). Each sequence included a 5′-overhang end complementary to either an RsrII or XhoI overhang. 5′GACCGACATTAAATTCTTGACAGGGAGAGATAGGTTTGATAGAATATAA (SEQ ID NO.:9) TAGTTGTC3′ 5′TCGAGACAACTATTATATTCTATCAAACCTATCTCTCCCTGTCAAGAATT (SEQ ID NO.:10) TAATGTCG3′

[0424] The underlined nucleotides comprise the 5′-overhang of the RsrII recognition site whereas the italicized nucleotides correspond to the 5′-overhang of the XhoI recognition site. The double stranded P59 RsrII/XhoI fragment was constructed by combining an equimolar amount of SEQ ID NOs.: 9 and 10, heating the mixture to 90° C. to denature any secondary structure, and allowing the complementary sequences to anneal by slowly cooling the solution to room temperature. The double stranded P59 promoter sequence was then ligated into gel-purified, RsrII/XhoI-digested pEPEF1 reporter vector thereby creating the Xyl-P59 fusion promoter. This reporter vector was termed pEPEF1-P59.

[0425] It will be appreciated that any promoter sequence, including the promoters of SEQ ID NOs.: 36-45, can be synthesized to include an RsrII overhang at one end and a XhoI overhang at the other. Promoter sequences can be wildtype promoters known from Lactococcus lactis and other Gram-positive species or promoters that have been modified to alter their transcription efficiency in gram-positive organisms. Example 7 provides a description of promoters that have been modified to alter their transcription efficiency in Staphylococcus aureus or Enterococcus faecalis.

[0426] The next example describes the construction of fusion promoters that have multiple operators.

EXAMPLE 2 Construction of Fusion Promoters Having a Plurality of Operators

[0427] This example describes the construction of fusion promoters comprising a plurality of operators. Fusion promoters such as Xyl/Lac-T5 and Xyl/Lac-P59 can be constructed by using a cloning strategy that is similar to the promoter swaping procedure described in Example 1. It will be appreciated that this procedure may be used to generate fusion promoters having two or more operators selected from the group consisting of xylO, tetO, trpO, malO, λc1O and lacO.

[0428] The lac operon is contained in many vectors including pGEM-11Zf(−), which is commercially available from Promega Corp. The sequence of the pGEM-11Zf(−) vector can be found in the NCBI database, Accession number X65314, the disclosure of which is incorporated herein by reference in its entirety. The lac operator comprises the 17 bp sequence: (TTGTTATCCGCTCACAA) (SEQ ID No. 11), which can be inserted in place of the 17 bp intervening sequence between the −35 Box and −10 Box of a variety of promoters, including the promoters of SEQ ID NOs.: 26-35.

[0429] In one example, a Xyl/Lac-T5 promoter can be made by replacing the 17 bp sequence, which comprises element (3) in FIG. 1, with the 17 bp lac operator sequence. To make the Xyl/Lac-T5 fusion promoter, two oligonucleotides are synthesized and annealed to create a double stranded fragment. The two oligonucleotides are provided below: 5′GACCGTCATAAAAAATTTATTTGCTTTGTGAGCGGATAACAATTATA (SEQ ID No.:12) ATAGATTCAAGTTAGTTTGTTTATTAAATTAACCAACTAAAATGTAG3′ 5′AATTCTACATTTTAGTTGGTTAATTTAATAAACAAACTAACTTGAATCT (SEQ ID No.:13) ATTATAATTGTTATCCGCTCACAAAGCAAATAAATTTTTTATGACG3′.

[0430] The resulting double stranded fragment includes: the RsrII sticky end (underlined portion of SEQ ID NO.: 12), an AT rich region found to increase the level of transcription from the T5 promoter in Gram-positive bacteria, the T5 promoter, the lac operator (lacO), the xylose operator (xylO), and the EcoRI sticky end (italicized portion of SEQ ID NO.: 13). This double standed fragment can be combined with the purified RsrII/EcoRI vector fragment of pKLO20 and ligated to form the vector pXyl/LacT5.

[0431] In a similar example, a Xyl/Lac-P59 promoter can be constructed by replacing the 17 bp sequence between the −35 Box and −10 Box regions (nucleotides 18-34) of SEQ ID NO.: 40) of P59 with the 17 bp lac operator sequence (SEQ ID NO.: 11). To make the Xyl/Lac-P59 fusion promoter, two oligonucleotides are synthesized and annealed to create a double stranded fragment. The two oligonucleotides are provided below: 5′GACCGACATTAAATTCTTGACATTGTGAGCGGATAACAATAGAATATAA (SEQ ID NO.:14) TAGTTGTC3′ 5′TCGAGACAACTATTATATTCTATTGTTATCCGCTCACAATGTCAAGAATT (SEQ ID NO.:15) TAATGTCG-3′.

[0432] The underlined nucleotides comprise the 5′-overhang of the RsrII recognition site whereas the italicized nucleotides correspond to the 5′-overhang of the XhoI recognition site. The double stranded P59/Lac fusion promoter formed by the annealing of these two oligonucleotides is then ligated with purified, RsrII/XhoI-digested pEPEF1, which contains the xylO operator, to form the vector pEPEF1-Lac/P59.

[0433] It will be appreciated that there are many equivalent techniques that can be used to construct fusion promoters having a plurality of operators. It will also be appreciated that operators other than lacO and promoters other than those described herein may be used in multi-operator constructs. For example, the fusion promoters may comprise a plurality of operators selected from the group consisting of xylO, tetO, trpO, malO, λC1O or lacO. Furthermore, one of ordinary skill in the art will recognize that an operator may be positioned in a various locations with respect to the promoter provided that the operator retains sufficient ability to regulate transcription.

[0434] The next example describes the construction of expression vectors containing regulatable fusion promoters.

EXAMPLE 3 Multipurpose Expression Vectors Containing Fusion Promoters

[0435] In this section, the construction and use of expression vectors comprising fusion promoters is described. Because these expression vectors can accept a wide range of fusion promoters each having a range of transcriptional activity, these vectors are useful for finely modulating the amount of nucleic acid or polypeptide expressed inside a host cell throughout a wide range of concentrations.

[0436] It will be appreciated that a regulatable fusion promoter suite for use in expression vectors can be constructed which provides a wide dynamic range of both basal and maximal transcriptional activities. Each fusion promoter within this suite can be tightly and finely regulated over its own characteristic range of transcriptional activity. Furthermore, the regulatable fusion promoters within the suite can comprise different promoters operably linked to the same operators thus providing for a common means of regulation of the promoters within the suite. For example, the promoters within the set of regulatable fusion promoters within the suite can be linked to at least one operator contained in a promoterless expression vector that is capable of replicating in one or more Gram-positive organisms. In other embodiments, a promoter operator fusion construct may be inserted into the vector as a cassette. Accordingly, a regulatable fusion promoter having an appropriate basal level and/or maximal level of transcriptional activity can be selected from the suite for use in the appropriate application. As described below, fusion promoters can be quickly and easily exchanged in the expression vector as needed.

[0437] Fusion promoters prepared as described in the preceding sections can be incorporated into existing expression vectors using conventional techniques in molecular biology. Embodiments include, for example, expression vectors having promoters of SEQ ID NOs.: 36-45 linked to operators such as xylO, xylO/lacO or one or more operators including, but not limited to tetO, malO, lacO, trpO, and λc1O. In some embodiments, the expression vectors comprising the fusion promoters allow for the transcription and/or expression of a peptide protein or transcript in a plurality of Gram-positive and Gram-negative bacterial species (e.g., Escherichia coli Staphylococcus species, and Enterococcus species).

[0438] These vectors can also comprise any of the following elements: genes encoding assayable markers such as enzymes that produce detectable products, genes encoding selectable markers such as polypeptides that control drug resistance, one or more unique restriction sites positioned downstream of the promoter such that inserts cloned into the site can be transcribed from the promoter, one or more transcription terminators such as a rho independent or rho dependent terminators which are positioned downstream of the unique cloning sites so as to terminate transcription of the nucleic acid insert, an origin of replication functional in Gram-negative species such as Escherichia coli, and an origin of replication functional in Gram-positive species such as Staphylococcus or Enterococcus.

[0439] For example, pXyl-T5 P15a is an expression vector which comprises the Xyl-T5 fusion promoter (see FIG. 5). This vector is 6858 bp in length and can be used to transcribe RNA and/or express a peptide or protein in Gram-positive bacteria including, but not limited to, Staphylococcus species. This vector comprises the Xyl-T5 fusion promoter positioned between the xylR gene and a multiple cloning site having unique restriction sites (e.g., EcoRI, SacI, XhoI, KpnI, SmaI, XmaI, BamHI, XbaI, and SalI sites), the beta-lactamase gene from pLex5BA the P15a replicon (which maintains the vector at approximately 15 copies/cell in E. coli), and the pC194 Staphylococcus replicon (which includes palA, palB, the pc194 origin of replication, the rep gene, and a chloramphenicol acetyl transferase gene that imparts chloramphenicol resistance in Gram-positive bacteria).

[0440] The pXyl-T5 p15a vector was constructed from pXyl-T5 by replacing the E. coli ColE1 origin of pXyl-T5, which allows replication of the vector at 100-200 copies per cell, with the P15a origin of pACYC184, which allows replication of the vector at approximately 15 copies per cell in E. coli. (Chang, A. C. Y. and Cohen, S. N., J. Bacteriol. 134:1141-1156 (1978), the disclosure of which is incorporated herein by reference in its entirety). The ColE1 origin lies within two closely linked NotI sites. To obtain the Xyl plasmid having the ColE1 origin, the pXyl-T5 vector was digested with NotI and the larger vector fragment was purified by preparative gel electrophoresis. Next, a derivative of plasmid pACYC184, which includes the P15a origin as a similar NotI cassette, was digested with NotI and the smaller P15a-encoding fragment was gel purified. (Diederich, L. et al., BioTechniques 16: 916-923 (1994), the disclosure of which is incorporated herein by reference in its entirety). The two purified NotI fragments were combined and ligated to form the vector pXyl-T5 P15a. Table 1 lists the components of the pXyl-T5 P15a vector and their locations. TABLE 1 Start End Type Position Position Name Description Gene  176 1327 xylR The gene encoding repressor protein XylR Promoter 1360 1455 A:T rich The modified T5 promoter T5 with upstream A:T rich box promotor Region 1455 1494 Multiple Multiple cloning site includes Cloning following unique restriction site sites: EcoRI, SacI, XhoI, KpnI, SmaI, XmaI, BamHI, XbaI, SalI Gene 2903 2043 ampR The beta-lactamase gene from pLex5BA Region 3037 3852 P15a A replicon that allows maintenance of vector at approximately 15 copies per cell in E. coli hosts. Inserted as NotI cassette at the Start and End numbers of the vector sequence. Region 4302 6659 S. aureus replicon from pC194. Includes the five features described below: Region 4462 4302 C palA With palB, imperfect direct repeats Gene 5242 4592 C Cat Gram-positive Chloramphen- icol acetyl transferase gene- imparts chloramphenicol resistance in S. aureus. Region 5562 5399 C ori pC194 origin of replication Gene 5699 6388 rep gene for rep protein Region 6659 6626 C palB

[0441] A plasmid map of pXyl-T5 P15a is provided in FIG. 5. The nucleotide sequence of pXyl-T5 P15a is provided in SEQ ID NO.: 16.

[0442] It will be appreciated that corresponding expression vectors which include fusion promoters comprising either single or multiple operators can be constructed using conventional techniques in molecular biology. For example, pXyl/Lac-T5 p15a can be constructed by digesting pXyl-T5 p15a with RsrII and EcoRI so as to remove the Xyl-T5 fusion promoter. After purification, the linear vector fragment is ligated to an RsrII/EcoRI fragment comprising the pXyl/Lac-T5 promoter the contruction of which was described in Example 2.

[0443] In another example, expression vectors functional in both Escherichia coli and Enterococcus faecalis were constructed by replacing the lacL-lacM reporter genes of pEPEF1 with the multiple cloning site (MCS) and rrnBtlt2 terminator of pLexP20. Specifically, pEPEF1 was digested with SmaI and SalI to remove the 3808 basepair fragment containing the lacL-lacM reporter genes. Following digestion, the large vector fragment was gel purified, the SalI overhanging end was made blunt with T4 DNA polymerase, and the fragment was circularized by ligating the blunt ends. The resulting plasmid, termed pEPEF2, was then digested with BamHI, gel purified, and blunted with T4 DNA polymerase. The ClaI/BstB1 fragment containing the MCS and rnnBtlt2 terminator was removed from pLexP20, gel purified, blunted using T4 DNA polymerase, then ligated to the blunt ended pEPEF2. A portion of the resulting ligation mixture was transformed into competent Escherichia coli and aliquots of the transformation mixture were plated on medium containing 20 μg/ml erythromycin. Isolated erythromycin-resistant transformants were picked and streaked to obtain single colony isolates. Plasmid DNA was then purified from representative single colony transformants. The presence and orientation of the MCS/terminator fragment was determined by PCR and DNA seqeuncing, respectively. A plasmid comprising the MCS/terminator fragment oriented such that the MCS was located adjacent to xylO was termed pEPEF3 (SEQ ID NO.: 17).

[0444] It will be appreciated that expression vectors such as those described above can be used to construct derivative expression vectors which include fusion promoters comprising either single or multiple operators. For example, the CP25 promoter can be removed from pEPEF3 by digestion of this vector with RsrII and XhoI. After purification, the linear vector fragment is ligated to an RsrII/XhoI fragment comprising P59. A pEPEF based expression vector having a generic promoter, termed pEPEF-X is shown in FIG. 6. Accordingly, many expression vectors derived from either pXyl-T5 or pEPEF3 can be constructed by exchanging the promoter sequence contained therein with the desired promoter sequence. In some embodiments, an expression vector is constructed comprising a promoter selected from SEQ ID NOs.: 36-45. pEPEF-X based expression vectors containing the Lactococcus lactis derived promoter sequences described herein, include pEPEF9 (SEQ ID NO.: 46), pEPEF14 (SEQ ID NO.: 47), pEPEF14* (SEQ ID NO.: 48), P1P2/EF14 (SEQ ID NO.: 49), pEPEF18 (SEQ ID NO.: 50), pEPEF20 (SEQ ID NO.: 51), and pEPEF22 (SEQ ID NO.: 52).

[0445] Several methods of transcribing an RNA and/or expressing a peptide or protein are also embodiments included herein. More specifically, the fusion promoters described herein can be used to transcribe a sense or anti-sense RNA or can be used to express a recombinant peptide or protein in a cell.

[0446] The next example describes experiments that were performed to evaluate the ability of the Xyl-T5 promoter to transcribe RNA.

EXAMPLE 4 Activity of Fusion Promoters by Real-Time RT-PCR

[0447] To analyze the strength of the Xyl-T5 promoter that comprised the T5 promoter operably linked to an AT Box and an xylO operator, transcript accumulation experiments were conducted using real-time RT-PCR (Reverse Transcriptase Polymerase Chain Reaction). RT-PCR detects a specific RNA by PCR amplification of a reverse transcriptase synthesized cDNA copy of the RNA. With real time RT-PCR, the accumulation of PCR amplified DNA is measured at each amplification step using a specific fluorogenically labeled oligonucleotide probe. The probe anneals to the newly formed cDNA product and is subsequently cleaved by the 5′ nuclease activity of the Taq DNA Polymerase during DNA amplification. Cleavage of the probe results in an increase in fluorescent dye signal, which is used to monitor accumulation of the specific PCR product at each PCR amplification step. The PCR cycle time at which logarithmic amplification is reached is proportional to the initial amount of RNA template in the reaction.

[0448] The levels of transcription from the Xyl-T5 promoter were evaluated in the following experiment. Fresh overnight colonies of Staphylococcus aureus RN4420 containing plasmid a plasmid having the Xyl-T5 promoter were added to 10 ml of Luria Broth (LB) that was supplemented with 15 μg/ml chloramphenicol (LB CM15). The cells were grown at 37° C. with shaking until early log phase (OD600 between 0.1 and 0.2) at which point they were diluted {fraction (1/10)} into pre-warmed LB CM15. After one hour of growth at 37° C. with shaking, three 1 ml aliquots of cells were pelleted and quick frozen on dry ice. The remaining suspension was split into two flasks. Xylose was added to a final inducer concentration of 2.0% in one flask, whereas the second flask remained non-induced (control). The resulting transcript is approximately 300 nucleotides in length and corresponds to vector sequence.

[0449] Immediately after xylose induction (2 minute time point) and at 10, 20, 30 and 60 minute time points, (three) 1 ml aliquots of cells were pelleted from each flask (with and without xylose) and quick frozen on dry ice. Cell pellets were stored at −80° C. Total RNA was purified from each sample using a Qiagen 96 well RNeasy kit. Approximately, 1 ng of RNA (two samples for each time point) was used as template for RT-PCR with an oligonucleotide primer set (forward and reverse primers, and a fluorogenic probe) for specific detection of the 3′ tail of the Xyl-T5 transcript. The reaction consisted of 300 nM each of the forward and reverse primers; 100 nM fluorogenic probe; 0.75 U AmpliTaq Gold (PE Biosystems); 7.5 U MultiScribe Reverse Transcriptase (PE Biosystems), 1× Buffer A (PE Biosystems); 5.8 mM MgCl, and 333 nM of each dNTP. The RT-PCR conditions were 48° C. for 30 minutes, 95° C. for 10 minutes, followed by 40 cycles of 95° C. for 15 seconds and 60° C. for 1 minute. RT-PCR reactions and detection were done in an ABI Prism 7700 Sequence Detection System.

[0450] The results of the experiment described above are shown in FIG. 7. Within 2 minutes of xylose induction, greater than a 30 fold excess of RNA was detected, as compared to the non-induced controls. At 60 minutes, greater than a 130 fold excess of RNA was detected in the xylose induced samples, as compared with non-induced bacteria. This example demonstrates that the fusion promoters described herein can be used to efficiently transcribe an RNA. Further, the data described herein verify that in the absence of xylose or an analog thereof Xyl-T5 promoter-driven transcription can be effectively repressed.

[0451] It will be appreciated that the fusion promoters described above can be used to construct a regulatable fusion promoter suite which provides a wide dynamic range of both basal and maximal transcriptional activities. Each fusion promoter within this suite can be tightly and finely regulated over its own characteristic range of transcriptional activity. Furthermore, the regulatable fusion promoters within the suite can comprise different promoters operably linked to the same operators thus providing for a common means of rapidly regulating the promoters within the suite. In some embodiments, the promoters within the set of regulatable fusion promoters within the suite can be linked to at least one operator contained in a promoterless expression vector that is capable of replicating in one or more Gram-positive organisms. Accordingly, a regulatable fusion promoter having an appropriate basal level and/or maximal level of transcriptional activity can be selected from the suite for use in the appropriate application.

[0452] The example below demonstrates the relative strengths of the fusion promoters described herein.

EXAMPLE 5 Activity of Fusion Promoters by Expression of Reporter Genes

[0453] This example describes the use of reporter gene constructs to measure the activity of fusion promoters such as those constructed in Examples 1 and 2. In one experiment, the pXyl-T5 and pXyl-T5 P15a vectors were tested to determine the relative level of expression of a reporter gene in both Escherichia coli and Staphylococcus aureus. To test the ability of the Xyl-T5 promoter to induce transcription and express a heterologous polypeptide, a construct was made in which the reporter gene was operably linked to the Xyl-T5 promoter. A pXyl-T5 vector was digested with SacI and SalI, and the linearized vector was purified by preparative gel electrophoresis. The linearized vector fragment was then ligated with a fragment containing the reporter gene. This construct was named pXyl-T5-Reporter. The pXyl-T5-Reporter vector containing the reporter gene under the control of the Xyl-T5 promoter was then introduced into S. aureus and the cells were cultured in the presence and absence of 2.0% xylose. The transcription and translation of the gene product encoded by the reporter gene was evident when xylose was provided in the growth medium but in the absence of an agent that inhibits the binding of the xylose repressor to the xylose operator, production of the gene product encoded by the reporter gene was not detectable.

[0454] When the vector containing the reporter gene under the control of the Xyl-T5 promoter was transformed in Escherichia coli, it was noted that expression of the gene product encoded by the reporter gene was very high, and uncontrollable either in the presence or absence of xylose. This may be because Escherichia coli does not express the XylR repressor. In those embodiments where it is desirable that Escherichia coli serve as a surrogate host, pXyl-T5 p15a, which has a reduced copy number in Escherichia coli can be used.

[0455] To evaluate the effect of the P15a origin on the level of reporter gene expression in Staphylococcus aureus and Escherichia coli in the presence and absence of xylose, a construct similar to pXyl-T5-Reporter was made, using the pXyl-T5 p15a vector. This construct was denoted pXyl-T5 p15a-Reporter. A comparison of the induction of reporter gene expression of pXyl-T5-Reporter and pXyl-T5 p15a-Reporter showed that the two vectors functioned identically when transformed and expressed in Staphylococcus aureus, but the P15a variant showed a markedly reduced expression of the reporter gene when transformed in Escherichia coli. The demonstrates that the p15a origin of replication decreases reporter gene expression by reducing the copy number of the expression vector thereby reducing the number Xyl-T5 promoters in the cell.

[0456] Fusion promoters contained on pEPEF1 and its derivatives were evaluated in determine their strength in Enterococcus faecalis by measuring the β-galactosidase activity produced by expression of the lacL-lacM reporter genes under induced and uninduced conditions. In one experiment, derivatives of pEPEF1 were prepared by replacing CP25 with the Lactococcus lactis derived promoters P32 (SEQ ID NO.: 39), P59 (SEQ ID NO.: 40), or P1P2 (SEQ ID NO.: 42) as described in Example 1. pEPEF1 and the resulting derivatives were each separately transformed into electrocompetent Enterococcus faecalis and the cells were cultured in the absense and the presence of 5% xylose. The β-galactosidase activity produced by the expression of the reporter genes was measured essentially as described in Israelsen et al. Appl. Environ. Microbiol. 61:2540-47 (1995), the disclosure of which is incorporated herein by reference in its entirety, with the following modifications. After incubation with or without xylose for four hours, the cells were harvested by centrifugation then resuspended in 1 ml of Z buffer comprising 0.06 M Na₂HPO₄.7H₂O, 0.04 M NaH₂PO₄.H₂O, 0.01 M KCl, 0.001 M MgSO₄.7H₂O, 0.05 M β-mercaptoethanol, at pH 7.0, with 0.6 ml of glass beads (150-212 micron diameter). The cells were lysed using a bead beater at maximun speed for 1 minute. After centrifugation, the supernatant was removed, vortexed with one drop of 0.1% SDS and 2 drops of chloroform at high speed for 10 seconds, then incubated with 200 μl of a 4 mg/ml o-nitrophenyl-β-D-galactopyranoside (ONPG) solution at 37° C. After approximately 20 minutes, the reaction was stopped by adding 250 μl of 1M sodium carbonate. β-galactosidase activity was quantitated spectrophotometrically. All assays were performed in triplicate.

[0457]FIG. 8 displays the amount of β-galactosidase activity produced by each fusion promoter in Enterococcus faecalis in the presence and absence of xylose. Promoters CP25 and P32 both produce very little β-galactosidase activity in the absence of xylose; however, in the presence of 5% xylose, both of these promoters express enough β-galactosidase to produce approximately 25 Miller units of activity. The activities of both P59 and P1P2 are much greater than that of either CP25 or P32. In the presence of xylose, P59 and P1P2 each produce between 125 and 150 Miller units of β-galactosidase activity. In the absence of xylose, P59 is strongly repressed having less than 4% of its induced activity. By contrast, P1P2 produces about 20% of its induced activity even in the absence of xylose.

[0458] The above data shows that the fusion promoters described herein differ in the levels of transcription which they provide and the levels of transcription under non-inducing conditions. Some promoters are highly active under induced conditions whereas others show only a modest increase in activity upon induction. Furthermore, some promoters described herein may be strongly repressed in the absence of xylose whereas the activity of other promoters may only be partially inhibited.

[0459] It will be appreciated that the fusion promoters described above can be used to construct a regulatable fusion promoter suite which provides a wide dynamic range of both basal and maximal transcriptional activities. Each fusion promoter within this suite can be tightly and finely regulated over its own characteristic range of transcriptional activity. Furthermore, the regulatable fusion promoters within the suite can comprise different promoters operably linked to the same operators thus providing for a common means of regulation of the promoters within the suite. In some embodiments, the promoters within the set of regulatable fusion promoters within the suite can be linked to at least one operator contained in a promoterless expression vector that is capable of replicating in one or more Gram-positive organisms. Accordingly, a regulatable fusion promoter having an appropriate basal level and/or maximal level of transcriptional activity can be selected from the suite for use in the appropriate application.

[0460] In another experiment, the activity of the CP25 fusion promter was compared to the activity of Xyl-T5. To produce a pXyl-T5 p15a vector containing the lacL-lacM reporter, the purified 4 kb HindIII/SalI fragment of pAK80 which contains the lacL-lacM reporter construct was ligated into purified pXyl-T5 p15a which had been linearized using HindIII and SalI. The resulting constuct was termed pXyl-T5 p15a LacLM. To compare the activity of the T5 fusion promoter with that of CP25, pXyl-T5 p15a LacLM and pEPEF1 were transformed separately into Enterococcus faecalis and the cells were cultured in the presence of 1% and 5% xylose and in its absence. The β-galactosidase activity produced by the expression of the reporter genes was measured using the Miller assay as described above.

[0461]FIG. 9 displays the amount of β-galactosidase activity produced by each fusion promoter under induced and uninduced conditions. In the absence of xylose, both the fusion promoters are strongly repressed in Enterococcus faecalis. When induced with 1% xylose, CP25 has about twice the activity of Xyl-T5 in Enterococcus faecalis. At 5% xylose, CP25 produces about 35 Miller units of activity compared to about 22 Miller units by Xyl-T5.

[0462] To determine whether there is a difference in the level of gene expression from pXly-T5 p15a in Staphylococcus aureus and Enterococcus faecalis, pXyl-T5 p15a LacLM was also transformed into Staphylococcus aureus which was grown in the presence of 1% and 5% xylose and in its absensce. The β-galactosidase activity produced by the expression of the reporter genes was measured using the Miller assay as described above.

[0463]FIG. 9 shows that the activity of Xyl-T5 is similar in Staphylococcus aureus and Enterococcus faecalis. When induced with 5% xylose in each organism, the Xyl-T5 fusion produces appoximately 22 Miller units of β-galactosidase activity.

[0464] The above data shows that the fusion promoters disclosed herein have the ability to function in a variety of Gram-positive organisms and to maintain their characteristic activities when transferred from one host to another. This result is in constrast to the alteration in activity that is seen when the promoter is transferred to Gram-negative organisms. One example is the unregulated activity of Xyl-T5 in in Escherichia coli.

[0465] The example below demonstrates that the activity of the fusion promoters described herein can be finely regulated over a range of inducer concentrations. In other words, the activity of these fusion promoters is titratable.

EXAMPLE 6 Titratable Expression from Fusion Promoters

[0466] This example shows that the activity of promoters described herein is titratable with inducer thus allowing a reproducible, incremental modulation of promoter activity by altering the concentration of inducer. Each fusion promoter described herein possesses its own characteristic activity over a range of xylose concentrations. Within this range, the amount of RNA transcribed from a fusion promoter can be finely regulated in repsonse to the concentration of xylose used to induce the promoter.

[0467] In one example, the derivative of pEPEF1 containing the P59 promoter (pEPEF1-P59) was transformed into Enterococcus faecalis and the transformants were cultured in the presence of increasing concentrations of xylose (from 0.1% to 10%) and in its absense. The β-galactosidase activity produced by the expression of the lacL-lacM reporter genes was measured using the Miller assay as described in Example 5. FIG. 10 shows that the activity of the P59 promoter increases in response to induction with increasing concentrations of xylose.

[0468] The next example describes modifications to fusion promoters that effect their transcriptional activity.

EXAMPLE 7 Modifications of Fusion Promoters that Alter Transcription Efficiency

[0469] This example describes the construction of fusion promoters that have been modified to have altered transcriptional activity in Gram-positive organisms. More specifically, modified fusion promoters, which are derived from wildtype promoters, have been constructed by creating mutations in the promoter sequence which either increase or diminish the strength of the promoter.

[0470] In one example, the strong Lactococcus lactis promoter, PL (SEQ ID NO.: 44), was modified to increase its activity in Gram-positive organisms. First, a regulatable PL fusion promoter was constructed by linking the PL promoter to a xylose operator using the methods described previously. Examination of the PL sequence revealed that changing the C at position 45 of SEQ ID NO.: 44 to an A would make the −10 Box region of promoter PL identical to −10 Box consensus site reported for Gram-positive organisms (TATAAT). PL10, a derivative of PL which incorporates this single basepair modification, was constructed by annealing the complementary synthetic oligonucleotides of SEQ ID NOs.: 18 and 19 using the procedure described in Example 1. 5′GACCGTTTCGTGAACTTTTTTGTTGACAAAGATAAAAACACATGATATA (SEQ ID NO.:18) ATTAAATCAC3′ 5′TCGAGTGATTTAATTATATCATGTGTTTTTATCTTTGTCAACAAAAAAGTT (SEQ ID NO.:19) CACGAAACG3′

[0471] The underlined nucleotides comprise the 5′-overhang of the RsrII recognition site whereas the italicized nucleotides correspond to the 5′-overhang of the XhoI recognition site. The double stranded promoter formed by the annealing of these two oligonucleotides was then ligated with purified, RsrII/XhoI-digested pEPEF1 to form a pEPEF1 derivative containing the Xly-PL10 fusion promoter. This derivative was termed pEPEF1-PL10. To determine the effect of this modification, the strengths of PL10, PL and P59 were compared by separately transforming pEPEF1 derivatives containing the appropriate promoter into Enterococcus faecalis then culturing the cells in the absense or presence of 5% xylose. The β-galactosidase activity produced by the expression of the lacL-lacM reporter genes was measured using the Miller assay as described in Example 5.

[0472]FIG. 11 compares the strengths of the PL, PL10 and P59 fusion promoters expressing the lacL-lacM reporter genes. When induced with 5% xylose, the activity of PL10 is about 20% greater than that of the wildtype PL promoter. Both PL and PL10 have substantially greater activity than P59 under induced conditions. In the absence of xylose, P59 is almost completely repressed whereas both PL and PL10 show significant basal activity.

[0473] In another example, P59 was modified to increase its activity in Gram-positive organisms. Examination of the P59 sequence revealed that changing the G at position 37 of SEQ ID NO.: 40 to a T would make the −10 Box region of promoter P59 identical to −10 Box consensus site in Gram-positive organisms. Additionally it appeared that addition of an AT rich region upstream of the −35 Box might enhance promoter activity. A derivative of P59 which incorporates these modifications was constructed by annealing the complementary synthetic oligonucleotides of SEQ ID NOs.: 20 and 21 using the procedure described in Example 1. 5′GACCGAAAAATGACAGTTTATTCTTGACAGGGAGAGATAGGTTTGATAT (SEQ ID NO.:20) AATATAATAGTTGTC3′ 5′TCGAGACAACTATTATATTATATCAAACCTATCTCTCCCTGTCAAGAATA (SEQ ID NO.:21) AACTGTCATTTTTCG3′

[0474] The underlined nucleotides comprise the 5′-overhang of the RsrII recognition site whereas the italicized nucleotides correspond to the 5′-overhang of the XhoI recognition site. The double stranded promoter formed by the annealing of these two oligonucleotides was then ligated with purified, RsrII/XhoI-digested pEPEF1, transformed into E. faecalis, and assayed as previously described. When compared to the unmodified P59, the derivative had about 20% higher activity.

[0475] It will be appreciated that modified fusion promoters, such as those described above, can be used to construct a regulatable fusion promoter suite which provides a wide dynamic range of both basal and maximal transcriptional activities. Modifying regulatable fusion promoters so as to increase their activity expands the range of both basal and maximal transcriptional activities of a suite of regulatable fusion promoters. Each modified fusion promoter within this suite can be tightly and finely regulated over its own characteristic range of transcriptional activity. Furthermore, the modified regulatable fusion promoters within the suite can comprise different promoters operably linked to the same operators thus providing for a common means of regulation of the promoters within the suite. In some embodiments, the modified promoters within the set of regulatable fusion promoters within the suite can be linked to at least one operator contained in a promoterless expression vector that is capable of replicating in one or more Gram-positive organisms. Accordingly, a regulatable fusion promoter having an appropriate basal level and/or maximal level of transcriptional activity can be selected from the suite for use in the appropriate application.

[0476] In yet another example, the Xly-T5 fusion promoter was modified to decrease its activity in Gram-positive organisms. By changing four nucleotides in the −35 Box of SEQ ID NO.: 36 (G to T at postion 18, C to A at position 19, T to C at position 20 and T to A at position 21) and one in the −10 Box of SEQ ID NO.: 36 (A to T at position 43) the Xyl-T5 promoter could be altered to include −10 and −35 elements that are identical to those of the weak xylA promoter. (For a description of the xylA promoter, see Sizemore et al., Mol. Gen. Genet. 227:377-384 (1991) and Schnappinger, D. et al., FEMS Microbiology Letters 129: 121-128 (1995), the disclosures of which are incorporated herein by reference in their entireties). Xyl-T5-DD, a derivative of Xyl-T5 which incorporates these modifications was constructed by annealing the complementary synthetic oligonucleotides of SEQ ID NOs.: 22 and 23 using the procedure described in Example 1. 5′GACCGTCATAAAAAATTTATTTTACATCAGGAAAATTTTTCTGTATATTA (SEQ ID NO.:22) GATTCAAGTTAGTTTGTTTATTAAATTAACCAACTAAAATGTAG3′ 5′AATTCTACATTTTAGTTGGTTAATTTAATAAACAAACTAACTTGAATCTA (SEQ ID NO.:23) ATATACAGAAAAATTTTCCTGATGTAAAATAAATTTTTTATGACG3′

[0477] The underlined nucleotides comprise the 5′-overhang of the RsrII recognition site whereas the italicized nucleotides correspond to the 5′-overhang of the EcoRI recognition site. The double stranded promoter formed by the annealing of these two oligonucleotides was then ligated with purified, RsrII/EcrRI-digested pXyl-T5-Reporter (prepared as described in Example 5). The pXyl-T5-Reporter vector was then introduced into S. aureus and the cells were cultured in the presence and absence of 2.0% xylose and expression levels were measured. Xyl-T5-DD had only about 10% of the activity of unmodified Xyl-T5. The nucleotide sequence of pXyl-T5-DD P15a, a Xyl-T5-DD based expression vector, is provided in (SEQ ID NO.: 53).

[0478] It will be appreciated that modified fusion promoters, such as those described above, can be used to construct a regulatable fusion promoter suite which provides a wide dynamic range of both basal and maximal transcriptional activities. Modifying regulatable fusion promoters so as to decrease their activity expands the range of both basal and maximal transcriptional activities of a suite of regulatable fusion promoters. Each modified fusion promoter within this suite can be tightly and finely regulated over its own characteristic range of transcriptional activity. Furthermore, the modified regulatable fusion promoters within the suite can comprise different promoters operably linked to the same operators thus providing for a common means of regulation of the promoters within the suite. In some embodiments, the modified promoters within the set of regulatable fusion promoters within the suite can be linked to at least one operator contained in a promoterless expression vector that is capable of replicating in one or more Gram-positive organisms. Accordingly, a regulatable fusion promoter having an appropriate basal level and/or maximal level of transcriptional activity can be selected from the suite for use in the appropriate application.

[0479] It will be appreciated that the methods described herein can be used to create promoter sequence modifications that either enhance or reduce promoter activity. These modifications can be made either alone or in combination. Among these changes are modifications of nucleotides within the promoter's −35 and −10 Box elements which make the promoter more or less like the consensus −35 Box and −10 Box sequences (TTGACA and TATAAT, respectively). Other modifications include altering the length of the region between the operator or transcription start site and the −10 Box, altering the length and/or composition of the region between the −10 Box and the −35 Box, and altering the length and/or composition of the AT rich region upstream of the −35 Box (i.e. the −45 Box). It will also be appreciated that modifications which affect promoter activity can include addition of elements such as the CRE region for catabolite repression. These additional elements can be further modified so that they are more or less like their corresponding consensus sequence. Promoters which incorporate one or more modifications similar to those described above are represented by SEQ ID NOs.: 37, 41, 43 and 45.

[0480] The experiments described in the Examples 4-7 verify that the fusion promoters disclosed herein can be used to tightly and finely regulate transcription of an RNA or expression of a peptide or a protein throughout a range of activity levels.

[0481] It will be appreciated that both modified and unmodified fusion promoters, such as those described above, can be used to construct a regulatable fusion promoter suite which provides a wide dynamic range of both basal and maximal transcriptional activities. Modifying regulatable fusion promoters so as to increase or decrease their activity expands the range of both basal and maximal transcriptional activities of a suite of regulatable fusion promoters. The addition of unmodified fusion promoters having unique basal and/or maximal transcriptional activities also expands the dynamic range of the suite. Each regulatable fusion promoter within this suite can be tightly and finely regulated over its own characteristic range of transcriptional activity. Furthermore, the regulatable fusion promoters within the suite can comprise different promoters operably linked to the same operators thus providing for a common means of regulation of the promoters within the suite. In some embodiments, the promoters within the set of regulatable fusion promoters within the suite can be linked to at least one operator contained in a promoterless expression vector that is capable of replicating in one or more Gram-positive organisms. Accordingly, a regulatable fusion promoter having an appropriate basal level and/or maximal level of transcriptional activity can be selected from the suite for use in the appropriate application. Such a suite of promoters is useful in the methods of identifying genes required for proliferation or methods of identifying compounds which inhibit proliferation described herein.

[0482] The examples below discusses methods that use fusion promoters to identify essential genes and compounds having potential antibiotic activity.

EXAMPLE 8 Identification of Genes Required for Cellular Proliferation by Expressing Proliferation-Inhibiting Antisense Nucleic Acids from a Fusion Promoter

[0483] This example describes the examination of a library of candidate antisense nucleic acid operably linked to fusion promoters to identify essential genes in Staphylococcus aureus and Enterococcus faecalis. Although the following examples utilize the fusion promoters Xyl-T5, CP25 and P59, it will be appreciated that any of the fusion promoters described herein may be used.

[0484] It will be appreciated that a regulatable fusion promoter suite which provides a wide dynamic range of both basal and maximal transcriptional activities can be used to facilitate the discovery of proliferation-required genes. Each fusion promoter within this suite can be tightly and finely regulated over its own characteristic range of transcriptional activity. Furthermore, the regulatable fusion promoters within the suite can comprise different promoters operably linked to the same operators thus providing for a common means of regulation of the promoters within the suite. In some embodiments, the promoters within the set of regulatable fusion promoters within the suite can be linked to at least one operator contained in a promoterless expression vector that is capable of replicating in one or more Gram-positive organisms. Accordingly, a regulatable fusion promoter having an appropriate basal level and/or maximal level of transcriptional activity can be selected from the suite based on the expression level of the proliferation-required genes that are targeted for discovery.

[0485] In the methods for identifying genes encoding gene products required for cellular proliferation using antisense nucleic acids expressed from an inducible fusion promoter, random genomic fragments are obtained from the organism in which it is desired to identify genes required for cellular proliferation. The random genomic fragments may be generated by a partial digestion with a restriction enzyme, mechanical shearing, using techniques such as sonication and nebulization, or DNAseI digestion. The random genomic fragments are operably linked to a regulatable fusion promoter in a vector, such as one of the expression vectors described herein. In those instances where the inserted genomic fragments are in an antisense orientation with respect to the promoter, the transcript produced is complementary to at least a portion of an mRNA encoding a gene product such that they interact with sense mRNA produced from various genes and thereby decrease the translation efficiency or the level of the sense messenger RNA (mRNA) thus decreasing production of the protein encoded by these sense mRNA molecules. In cases where the sense mRNA encodes a protein required for proliferation, cells grown under inducing conditions fail to grow or grow at a substantially reduced rate. Additionally, in cases where the transcript produced is complementary to at least a portion of a non-translated RNA and where that non-translated RNA is required for proliferation, cells grown under inducing conditions also fail to grow or grow at a substantially reduced rate. In contrast, cells grown under non-inducing conditions grow at a normal rate.

[0486] The genes to which the antisense nucleic acids are complementary are then identified and utilized in the methods of the present invention. Thus, to identify genes required for cellular proliferation, the extent of proliferation of cells containing the vectors in the presence of an inducer which induces transcription from the regulatable fusion promoter is compared to the extent of proliferation of cells in the absence of the inducer. Those cells which grow well in the absence of the inducer but exhibit significantly reduced proliferation in the presence of the inducer contain a vector encoding an antisense nucleic acid complementary to at least a portion of a gene required for cellular proliferation.

[0487] Use of the above method to identify genes required for cellular proliferation in E. coli, Staphylococcus aureus, Salmonella typhimurium, Klebsiella pneumoniae, Pseudomonas aeruginosa and Enterococcus faecalis has been described in the following U.S. patent applications, the disclosures of which are incorporated herein by reference in their entireties: U.S. patent application Ser. No. 09/815,242, filed Mar. 21, 2001; U.S. patent application Ser. No. 09/492,709, filed Jan. 27, 2000; U.S. patent application Ser. No. 09/711,164, filed Nov. 9, 2000; U.S. patent application Ser. No. 09/741,669, filed Dec. 19, 2000; U.S. patent application Ser. No. 09/815,242, filed Mar. 21, 2001; and U.S. patent application Ser. No. 09/948,993, filed Sep. 6, 2001. The methods used to identify genes required for cellular proliferation in Staphylococcus aureus, and Enterococcus faecalis are summarized below.

[0488] Nucleic acids involved in proliferation of Staphylococcus aureus were identified using the EGI technique as follows. A shotgun library of Staphylococcus aureus genomic fragments was cloned into the vector pXyIT5-P15a, which harbors the Xyl-T5 inducible promoter. The vector was linearized at a unique BamHI site immediately downstream of the XyIT5 promoter/operator. The linearized vector was treated with shrimp alkaline phosphatase to prevent reclosure of the linearized ends. Genomic DNA isolated from Staphylococcus aureus strain RN450 was fully digested with the restriction enzyme Sau3A or alternatively, partially digested with DNase I and “blunt-ended” by incubating with T4 DNA polymerase. Random genomic fragments between 200 and 800 base pairs in length were selected by gel purification. The size-selected genomic fragments were added to the linearized and dephosphorylated vector at a molar ratio of 0.1 to 1, and ligated to form a shotgun library.

[0489] The ligated products were transformed into electrocompetent E. coli strain XL1-Blue MRF (Stratagene) and plated on LB medium with carbenicillin at 100 μg/ml. Resulting colonies numbering 5×10⁵ or greater were scraped and combined, and were then subjected to plasmid purification.

[0490] The purified library was then transformed into electrocompetent Staphylococcus aureus strain RN4220. Resulting transformants were plated on LBG agar with chloramphenicol at 15 μg/ml in order to generate 100 to 150 platings at 500 colonies per plating. The colonies were subjected to robotic picking and arrayed into wells of 384 well culture dishes. Each well contained 100 μl of LBG+CM15 liquid medium. Inoculated 384 well dishes were incubated 16 hours at 37° C., and each well was robotically gridded onto solid LBG+CM15 medium with or without 2% xylose. Gridded plates were incubated 16 hours at 37° C., and then manually scored for arrayed colonies that were growth-compromised in the presence of xylose. Such growth sensitivity occurred at a frequency of one in sixty arrayed colonies.

[0491] Arrayed colonies that were growth-sensitive on medium containing 2% xylose, yet were able to grow on similar medium lacking xylose, were subjected to further growth sensitivity analysis as follows: Colonies from the plate lacking xylose were manually picked and inoculated into individual wells of a 96 well culture dish containing LBG+CM15, and were incubated for 16 hours at 37° C. These cultures were robotically diluted {fraction (1/100)} into fresh medium and allowed to incubate for 4 hours at 37° C., after which they were subjected to serial dilutions in a 384 well array and then gridded onto media with and without 2% xylose. After growth for 16 hours at 37° C., the arrays of serially diluted spots that resulted were compared between the two media. Spots that grew similarly on both media were scored as a negative and corresponding colonies were no longer considered. Spots on xylose medium that failed to grow to the same serial dilution compared to those on the non-xylose plate were given a score based on the differential, i.e. should the spots on xylose only grow to a serial dilution of −4 while they were able to grow to −8 on the non-xylose plate, then the corresponding transformant colony received a score of “4” representing the log difference in growth observed.

[0492] Nucleic acids involved in proliferation of E. faecalis were identified as follows. A shotgun library of E. faecalis genomic fragments was cloned into either pEPEF3 or pEPEF14, which contain the CP25 or P59 promoter, respectively, regulated by the xyl operator/repressor. The vector was linearized at a unique SmaI site immediately downstream of the promoter/operator. The linearized vector was treated with alkaline phosphatase to prevent reclosure of the linearized ends. Genomic DNA isolated from E. faecalis strain OG1RF was partially digested with DNase I and “blunt-ended” by incubating with T4 DNA polymerase. Random genomic fragments between 200 and 800 base pairs in length were selected by gel purification. The size-selected genomic fragments were added to the linearized and dephosphorylated vector at a molar ratio of 2 to 1, and ligated to form a shotgun library.

[0493] The ligated products were transformed into electrocompetent E. coli strain TOP10 cells (Invitrogen) and plated on LB medium with erythromycin (Erm) at 150 μg/mL. Resulting colonies numbering 5×10⁵ or greater were scraped and combined, and were then subjected to plasmid purification.

[0494] The purified library was then transformed into electrocompetent E. faecalis strain OG1RF. Resulting transformants were plated on Todd-Hewitt (TH) agar with erythromycin at 10 μg/mL in order to generate 100 to 150 platings at 500 colonies per plating. The colonies were subjected to robotic picking and arrayed into wells of 384 well culture dishes. Each well contained 100 μL of THB+Erm 10 μg/mL. Inoculated 384 well dishes were incubated 16 hours at room temperature, and each well was robotically gridded onto solid TH agar+Erm with or without 5% xylose. Gridded plates were incubated 16 hours at 37° C., and then manually scored for arrayed colonies that were growth-compromised in the presence of xylose.

[0495] Arrayed colonies that were growth-sensitive on medium containing 5% xylose, yet were able to grow on similar medium lacking xylose, were subjected to further growth sensitivity analysis. Colonies from the plate lacking xylose were manually picked and inoculated into individual wells of a 96 well culture dish containing THB+Erm 10, and were incubated for 16 hours at 30° C. These cultures were robotically diluted {fraction (1/100)} into fresh medium and allowed to incubate for 4 hours at 37° C., after which they were subjected to serial dilution on plates containing 5% xylose or plates lacking xylose. After growth for 16 hours at 37° C., the arrays of serially diluted spots that resulted were compared between the two media. Colonies that grew similarly on both media were scored as a negative and corresponding colonies were no longer considered. Colonies on xylose medium that failed to grow to the same serial dilution compared to those on the non-xylose plate were given a score based on the differential. For example, colonies on xylose medium that only grow to a serial dilution of −4 while they were able to grow to −8 on the non-xylose plate, then the corresponding transformant colony received a score of “4” representing the log difference in growth observed.

[0496] Of the 1.4 million Enterococcus faecalis colonies screened using the CP25 fusion promoter, about 1 of every 3750 colonies had reduced proliferation ability whereas of the 390,000 colonies screened using the P59 fusion promoter, about 1 of every 390 colonies had reduced proliferation ability.

[0497] Thus, it will be appreciated that use of a regulatable fusion promoter from the suite of regulatable fusion promoters which has a higher level of transcriptional activity may increase the frequency at which proliferation-required genes are identified and may allow identification of proliferation-required genes which would not be identified using weaker promoters. Such a fusion promoter would be expected to produce a larger number of colonies having reduced proliferation ability per colonies screened. Furthermore, proliferation-required genes from new gene classes would be expected.

[0498] Expression vectors were purified from Staphylococcus aureus or Enterococcus faecalis colonies identified as having reduced proliferation upon induction of the fusion promoter. The nucleic acids contained on the expression vectors under control of the fusion promoter were isolated for subsequent nucleotide sequence determination and further characterization.

[0499] It will be appreciated that the random genomic fragments used in the above methods can be generated using restriction enzymes and endonucleases other than those specifically described or by any other procedure. In addition, random genomic fragments may be generated by mechanical shearing. Sonication and nebulization are two such techniques commonly used for mechanical shearing of DNA.

[0500] The next example describes experiments that use the Rapid Replacement of Genomic Promoters technique to identify essential genes for use as targets for drug discovery.

EXAMPLE 9 Replacement of Genomic Promoters for Proliferation-Required Genes with Inducible Fusion Promoters

[0501] Nucleic acids involved in proliferation of Staphylococcus aureus or Enterococcus faecalis are identified using the Rapid Replacement of Genomic Promoters technique described in U.S. patent application Ser. No. 09/948,993, filed Sep. 6, 2001, the disclosure of which is incorporated herein by reference in its entirety, and U.S. Provisional Patent Application Serial No. 60/230347, filed Sep. 6, 2000, the disclosure of which is incorporated herein by reference in its entirety. Initially, a target gene is selected. The gene may be a gene which is known or suspected to be essential for proliferation or a gene which has not been characterized with respect to its essentiality for proliferation. Identification of the essential gene can be accomplished by using the EGI technique or other methods including, but not limited to, genes shown to be essential in the literature or genes of unknown essentiality that are predicted to be essential by bioinformatics. Additionally, one can target essential operons containing multiple genes.

[0502] Next, the native promoter of the target gene or operon is replaced with a regulatable promoter, such as the fusion promoters described herein using “ET cloning” technology. Briefly, this is accomplished by engineering the one of the fusion promoters described herein such that it is 3′ of a selectable marker (e.g., a CAT gene). This entire cassette is amplified by PCR. This PCR product becomes the template for another round of PCR using primers with 80 bp of homology to the target promoter and 20 bp of homology to the fusion promoter-containing cassette. However, as little as 60 bp homology to the target promoter can be used. The region of homology is chosen such that the cassette will replace the promoter of the target gene without effecting the endogenous Shine-Delgamo sequence. This construct is then transformed into competent cells containing various levels of inducer (e.g. xylose or analog thereof) to assure that the correct level of expression is achieved to allow survival. In some embodiments, the native promoter in a strain having an enhanced frequency of homologous recombination may be replaced with a regulatable promoter.

[0503] The correct integration of the cassette can be confirmed by colony PCR as described in U.S. patent application Ser. No. 09/948,993, filed Sep. 6, 2001, and U.S. Provisional Patent Application entitled Rapid PCR Method for Determination of Whether a Gene is Essential, Serial No. 60/230347, filed Sep. 6, 2000, attorney docket number ELITRA.022PR, the disclosures of which are incorporated herein by reference in their entireties. Preferably, correct integrants are tested for growth defects as inducer is titrated away. The inability to grow in the absence of inducer will confirm that the target was essential. These strains can then be induced to a level determined to be the optimal balance between slowed growth and sensitivity to inhibition of the target protein for use in a cell-based assay for compounds with antibiotic activity or may be used to identify genes required for proliferation as described herein.

[0504] The next example describes experiments that use an operator insertion technique to identify essential genes for use as targets for drug discovery.

EXAMPLE 10 Regulation of Genomic Promoters for Proliferation-Required Genes by Operator Insertion

[0505] In this example, insertion of an operator into the promoter of a proliferation-required gene is described. A target promoter which drives the expression of a target gene or operon is identified. The target gene may be a gene which is known to be required for proliferation, suspected to be required for proliferation, or a gene which has not been characterized with respect to whether it is required for proliferation. An oligonucleotide comprising a xyl, tet, trp, mal, λc1 or lac operator flanked on each side by 40 nucleotides homologous to the target promoter is synthesized. The 40 nucleotide flanking sequences are determined based on the desired location for intergration of the xyl operator into the promoter. The single stranded oligonucleotide construct is then transformed into a bacterium having an enhanced frequency of homologous recombination. For example, the bacterium may express the λ Beta and Gam proteins. The cells in the transformation mixture are diluted and plated on medium containing xylose. Colonies in which the operator has integrated into the target promoter are identified by colony PCR. The identified colonies are grown in medium containing or lacking inducer. The colonies proliferate on medium containing inducer but fail to grow on medium lacking inducer, thereby indicating the target promoter directs transcription of a gene encoding a gene product required for proliferation. Strains in which a gene encoding a gene product required for proliferation is under the control of the xyl, tet, trp, mal, λc1 or lac operator may also be used to identify compounds which inhibit proliferation in the cell-based asays described herein.

[0506] It will be appreciated that operators other than the xyl, tet, trp, mal, λc1 or lac operator can be used with the methods described above.

[0507] The next example describes experiments that use the Random Insertional Cis-Antisense technique to identify essential genes for use as targets for drug discovery.

EXAMPLE 11 Insertional Inactivation of Proliferation-Required Genes by Random Insertion of an Inducible Fusion Promoter

[0508] Nucleic acids involved in proliferation of Staphylococcus or Enterococcus are identified using the Random Insertional Cis-Antisense technique described in U.S. Provisional Patent Application Serial No. 60/230,403, filed Sep. 6, 2000, the disclosure of which is incorporated herein by reference in its entirety. In one embodiment, a vector containing an outward-directed inducible fusion promoter is randomly integrated into the genome of a Gram-positive organism such as S. aureus or E. faecalis or the other Gram-positive organisms described herein and clones containing promoters in cis-antisense orientation with regard to essential genes are identified. Accordingly, a vector containing an inducible promoter, such as one of the fusion promoters described herein, is first engineered such that the inducible promoter reads into a multiple cloning site (MCS). The genetic marker is juxtaposed directly 3′ of the MCS. This positioning helps ensure that the transforming vector will not be degraded by exonucleases (transformants in which the vector is degraded will not be recovered because the genetic marker will be lost). The vector is linearized by cutting with any single restriction enzyme in the MCS. Using different enzymes to linearize may increase the randomness of integration of the vector. The linearized vector is then transformed into the appropriate bacterial strain.

[0509] The intergrative transformation frequency of linear DNA varies widely among bacterial strains. In some strains, linear vectors usually transform integratively at a significantly higher frequency than circular vectors. This is especially true in bacterial strains that have been modified to have an enhanced frequency of homologous recombination. In strains in which the frequency of integrative transformation of linear vectors is low, circular vectors often transform integratively at a higher frequency. The mechanism of integration is largely unknown but it is contemplated that integration occurs via nonhomologous recombination. Sites of integration appear to be entirely random. Transformants are robotically picked, grown, and are robotically gridded onto inducing and noninducing (inducing or not inducing transcription of the promoter) media. These gridded plates are then screened for transformants that have a pronounced growth defect on inducing media relative to the phenotype on noninducing media. Such transformants may have the vector integrated into their genome in several possible locations in an essential gene: the 3′ untranslated region or any part of the coding or noncoding region that does not result in disruption of function of the gene product. In most cases the inducible promoter is oriented such that it produces an antisense RNA from the target gene. Thus, when the promoter is induced the expression of the essential gene is reduced. Such inhibition can occur by one of several mechanisms including binding of the antisense RNA to the mRNA, colliding RNA polymerases, RNA interference (RNAi) or a combination of such phenomenon.

[0510] The identity of the essential gene is then determined by sequencing the junction of the vector with the essential gene, the junction where the inducible promoter is juxtaposed with genomic DNA. This assay can also be used to create bacterial strains for cell-based assays that evaluate candidate antibiotic compounds as described herein.

[0511] The next example describes approaches that were used to determine the identity of the newly discovered essential genes.

EXAMPLE 12 Nucleotide Sequence Determination of Genes that are Affected by Proliferation-Inhibiting Antisense Nucleic Acids Expressed from Fusion Promoters

[0512] With reference to experiments that were performed using the antisense technique, this example describes approaches that were used to determine the nucleic acid sequence of newly discovered essential genes. It should be understood that the teachings herein can also be used to determine the nucleic acid sequence of essential genes identified using the promoter replacement, operator insertion or Random Insertional Cis-Antisense technique.

[0513] Plasmids from transformant colonies that received a dilution plating score of “2” or greater were isolated to obtain the genomic DNA insert responsible for growth inhibition as follows. Staphylococcus aureus were grown in standard laboratory media (LB or TB with 15 ug/ml Chloramphenicol to select for the plasmid). Growth was carried out at 37° C. overnight in culture tubes or 2 ml deep well microtiter plates. Lysis was performed as follows. Cultures (2-5 ml) were centrifuged and the cell pellets resuspended in 1.5 mg/ml solution of lysostaphin (20 μl/ml of original culture) followed by addition of 250 μl of resuspension buffer (Qiagen). Alternatively, cell pellets were resuspended directly in 250 μl of resuspension buffer (Qiagen) to which 5-20 μl of a 1 mg/ml lysostaphin solution were added. DNA was isolated using Qiagen miniprep kits or Wizard (Qiagen) miniprep kits according to the instructions provided by the manufacturer.

[0514] The genomic DNA inserts were then amplified from the purified plasmids by PCR as follows. Approximately, 1:1 of Qiagen purified plasmid was put into a total reaction volume of 25:1 Qiagen Hot Start PCR mix. The following primers were used in the PCR reaction: pXylT5F: CAGCAGTCTGAGTTATAAAATAG (SEQ ID NO:24) LexL TGTTTTATCAGACCGCTT (SEQ ID NO:25)

[0515] PCR was carried out in a PE GenAmp with the following cycle times:

[0516] Step 1. 95° C. 15 min

[0517] Step 2. 94° C. 45 sec

[0518] Step 3. 54° C. 45 sec

[0519] Step 4. 72° C. 1 minute

[0520] Step 5. Return to step 2 (29 times)

[0521] Step 6. 72° C. 10 minutes

[0522] Step 7. 4° C. hold

[0523] The PCR products were cleaned using Qiagen Qiaquick PCR plates according to the manufacturer's instructions. The amplified genomic DNA inserts were then subjected to automated sequencing.

[0524] For E. faecalis, plasmids from transformant colonies that received a dilution plating score of “2” or greater were isolated to obtain the genomic DNA insert responsible for growth inhibition as follows. E. faecalis were grown in THB 10 μg/ml Erm at 30° C. overnight in 100 ul culture wells in microtiter plates. To amplify insert DNA 2 ul of culture were placed into 25 μl Qiagen Hot Start PCR mix. PCR reactions were in 96 well microtiter plates. Primers to vector regions flanking the insert were used in the PCR reaction. PCR was carried out in a PE GenAmp with the following cycle times:

[0525] Step 1. 95° C. 15 min

[0526] Step 2. 94° C. 45 sec

[0527] Step 3. 54° C. 45 sec

[0528] Step 4. 72° C. 1 minute

[0529] Step 5. Return to step 2, 29 times

[0530] Step 6. 72° C. 10 minutes

[0531] Step 7. 4° C. hold

[0532] The PCR products were cleaned using Qiagen Qiaquick PCR plates according to the manufacturer's instructions. The purified PCR products were then directly cycle sequenced with Qiagen Hot Start PCR mix. PCR was carried out in a PE GenAmp with the following cycle times:

[0533] Step 1. 94° C. 15 min

[0534] Step 2. 96° C. 10 sec

[0535] Step 3. 50° C. 5 sec

[0536] Step 4. 60° C. 4 min

[0537] Step 5. Return to step 2, 24 times

[0538] Step 6. 4° C. hold

[0539] The PCR products were cleaned using Qiagen Qiaquick PCR plates according to the manufacturer's instructions. The amplified genomic DNA inserts were then subjected to automated sequencing.

[0540] The example below describes techniques that were used to compare the sequences identified by the approaches above to sequences present in publicly available databases so as to better characterize the essential genes.

EXAMPLE 13 Comparison of Isolated Nucleic Acids to Known Sequences

[0541] With reference to experiments that were performed using the antisense technique, this example describes approaches that were used to characterize the nucleic acid sequence of newly discovered essential genes in S. aureus and E. faecalis. It should be understood that the teachings herein can also be used to determine the nucleic acid sequence of essential genes in other Gram-positive cells and essential genes identified in Gram-positive cells using the promoter replacement, operator insertion or Random Insertional Cis-Antisense technique.

[0542] The nucleic acid sequences of the subcloned fragments from S. aureus or E. faecalis obtained from the expression vectors containing the Xyl-T5 and CP25 fusion promoters discussed above were compared to known sequences from S. aureus or E. faecalis and other microorganisms as follows. First, to confirm that each clone originated from one location on the chromosome and was not chimeric, the sequences of the selected clones were compared against the S. aureus or E. faecalis genomic sequences to align the clone to the correct position on the chromosome. The NCBI BLASTN v 2.0.9 program was used for this comparison, and the incomplete Staphylococcus aureus genomic sequences licensed from TIGR, as well as the NCBI nonredundant GenBank database were used as the source of genomic data. The E. faecalis sequences were compared to a proprietary database.

[0543] The BLASTN analysis was performed using the default parameters except that the filtering was turned off. No further analysis was performed on inserts which resulted from the ligation of multiple fragments.

[0544] In general, antisense molecules and their complementary genes are identified as follows. First, all possible full length open reading frames (ORFs) are extracted from available genomic databases. Such databases include the GenBank nonredundant (nr) database, the unfinished genome database available from TIGR and the PathoSeq database developed by Incyte Genomics. The latter database comprises over 40 annotated bacterial genomes including complete ORF analysis. If databases are incomplete with regard to the bacterial genome of interest, it is not necessary to extract all ORFs in the genome but only to extract the ORFs within the portions of the available genomic sequences which are complementary to the clones of interest. Computer algorithms for identifying ORFs, such as GeneMark, are available and well known to those in the art. Comparison of the clone DNA to the complementary ORF(s) allows determination of whether the clone is a sense or antisense clone. Furthermore, each ORF extracted from the database can be compared to sequences in well annotated databases including the GenBank (nr) protein database, SWISSPROT and the like. A description of the gene or of a closely related gene in a closely related microorganism is often available in these databases. Similar methods are used to identify antisense clones corresponding to genes encoding non-translated RNAs.

[0545] In some cases, for example, the NCBI BLASTN 2.0.9 computer algorithm was used and the default parameters were used with the exception that filtering was turned off. The default parameters for the BLASTN and BLASTX analyses were:

[0546] Expectation value (e)=10

[0547] Alignment view options: pairwise

[0548] Filter query sequence (DUST with BLASTN, SEG with others)=T

[0549] Cost to open a gap (zero invokes behavior)=0

[0550] Cost to extend a gap (zero invokes behavior)=0

[0551] X dropoff value for gapped alignment (in bits) (zero invokes behavior)=0

[0552] Show GI's in deflines=F

[0553] Penalty for a nucleotide mismatch (BLASTN only)=!3

[0554] Reward for a nucleotide match (BLASTN only)=1

[0555] Number of one-line descriptions (V)=500

[0556] Number of alignments to show (B)=250

[0557] Threshold for extending hits=default

[0558] Perform gapped alignment (not available with BLASTX)=T

[0559] Query Genetic code to use=1

[0560] DB Genetic code (for TBLAST[nx] only=1

[0561] Number of processors to use=1

[0562] SeqAlign file

[0563] Believe the query defline=F

[0564] Matrix=BLOSUM62

[0565] Word Size=default

[0566] Effective length of the database (use zero for the real size)=0

[0567] Number of best hits from a region to keep=100

[0568] Length of region used to judge hits=20

[0569] Effective length of the search space (use zero for the real size)=0

[0570] Query strands to search against database (for BLAST[nx] and TBLASTX), 3 is both, 1 is top, 2 is bottom=3

[0571] Produce HTML output=F

[0572] Those ORFs found to be complementary to the cloned nucleic acid sequences discussed above were evaluated with regard to orientation. Antisense clones were identified as those clones for which transcription from the inducible promoter would result in the expression of an RNA antisense to the complementary ORF.

[0573] The gene descriptions in the PathoSeq database derive from annotations available in the public sequence databases described above. It will be appreciated that ORFs may also be identified using databases other than PathoSeq. For example, the ORFs may be identified using the methods described in U.S. Provisional Patent Application Serial No. 60/191,078, filed Mar. 21, 2000, the disclosure of which is incorporated herein by reference in its entirety.

[0574] In the antisense-based screening experiments in Enterococcus faecalis as described in Examples 8 and 9, 256 proliferation-inhibiting antisense sequences corresponding to 61 unique ORFs were discovered using the CP25 fusion promoter whereas 790 proliferation-inhibiting antisense sequences corresponding to 150 unique ORFs were discovered using the P59 fusion promoter.

[0575] The example below describes approaches that may be used to identify operons or downstream genes associated with the essential genes identified by the methods above.

EXAMPLE 14 Identification of Genes and their Corresponding Operons that are Affected by Proliferation-Inhibiting Antisense Nucleic Acids Expressed from Fusion Promoters

[0576] With reference to experiments that were performed using the antisense technique, this example describes approaches that may be used to identify operons or downstream genes associated with the newly discovered essential genes in S. aureus and E. faecalis. It should be understood that the teachings herein can also be used to identify operons or downstream genes associated with the essential genes in other Gram-positive cells and essential genes identified in Gram-positive cells using the promoter replacement, operator insertion or Random Insertional Cis-Antisense technique Once the genes involved in S. aureus and E. faecalis proliferation are identified as described above, the operons in which these genes lie may be identified by comparison with known microbial genomes. Since bacterial genes are expressed in a polycistronic manner, the antisense inhibition of a single gene in an operon might affect the expression of all the other genes on the operon or the genes downstream from the single gene identified. Accordingly, each of the genes contained within an operon may be analyzed for their effect on proliferation.

[0577] Operons are predicted by looking for all adjacent genes in a genomic region that lie in the same orientation with no large noncoding gaps in between. First, full length ORFs complementary to the antisense molecules are identified as described above. Adjacent ORFs are then identified and their relative orientation determined either by directly analyzing the genomic sequences surrounding the ORFs complementary to the antisense clones or by extracting adjacent ORFs from the collection obtained through whole genome ORF analysis described above followed by ORF alignment. Operons predicted in this way may be confirmed by comparison to the arrangement of the homologous genes in the Bacillus subtilis complete genome sequence, as reported by the genome database compiled at Institut Pasteur Subtilist Release R15.1 (Jun. 24, 1999) which can be found at http://bioweb.pasteur.fr/GenoList/SubtiList/. The Bacillus subtilis genome is the only fully sequenced and annotated genome from a Gram-positive microorganism, and appears to have a high level of similarity to Staphylococcus aureus both at the level of conservation of gene sequence and genomic organization including operon structure. Annotation of some of the DNA sequences in some of the aforementioned databases is lacking, but comparisons may be made to E. coli using tools such as BLASTX. Public or proprietary databases may be used to analyzed E. faecalis sequences as well as the databases listed above.

[0578] Once the full length ORFs and/or the operons containing them have been identified using the methods described above, they can be obtained from a genomic library by performing a PCR amplification using primers at each end of the desired sequence. Those skilled in the art will appreciate that a comparison of the ORFs to homologous sequences in other microorganisms will facilitate confirmation of the start and stop codons at the ends of the ORFs.

[0579] In some embodiments, the primers may contain restriction sites which facilitate the insertion of the gene or operon into a desired vector. For example, the gene may be inserted into an expression vector and used to express the proliferation-required protein as described below. Other methods for obtaining the full length ORFs and/or operons are familiar to those skilled in the art. For example, natural restriction sites may be employed to insert the full length ORFs and/or operons into a desired vector.

[0580] The next example describes approaches that may be used to identify individual genes within an operon that was characterized as being required for proliferation.

EXAMPLE 15 Identification of Individual Genes within an Operon Required for Proliferation

[0581] With reference to experiments that were performed using the antisense technique, this example describes approaches that may be used to identify individual genes within an operon that has been characterized as being required for proliferation of a Gram-positive organism such as S. aureus or E. faecalis or the other Gram-positive organisms listed herein. It should be understood that the teachings herein can also be used to identify individual genes within an operon that were characterized as being required for proliferation in other Gram-positive cells and can be used in conjunction with the promoter replacement, operator insertion or Random Insertional Cis-Antisense technique.

[0582] The strategy described in this example seeks to determine if a targeted gene within an operon is required for cell proliferation by replacing the targeted gene in the chromosome with an in-frame deletion of the coding region of the targeted gene. Deletion inactivation of a chromosomal copy of a gene in Staphylococcus aureus or Enterococcus faecalis can be accomplished by integrative gene replacement. The principles of this method were described in Xia, M., et al. 1999 Plasmid 42:144-149 and Hamilton, C. M., et al 1989. J. Bacteriol. 171: 4617-4622, the disclosures of which are incorporated herein by reference in their entireties. In this approach, a mutant allele of the targeted gene is constructed by way of an in-frame deletion and introduced into the chromosome using a suicide vector. This results in a tandem duplication comprising a deleted (null) allele and a wild type allele of the target gene. Cells in which the vector sequences have been deleted are isolated using a counter-selection technique. Removal of the vector sequence from the chromosomal insertion results in either restoration of the wild-type target sequence or replacement of the wild type sequence with the deletion (null) allele. E. faecalis genes can be disrupted using a suicide vector that contains an internal fragment to a gene of interest. With the appropriate selection this plasmid will homologously recombine into the chromosome (Nallapareddy, S. R., X. Qin, G. M. Weinstock, M. Hook, B. E. Murray. 2000. Infect. Immun. 68:5218-5224, the disclosure of which is incorporated herein by reference).

[0583] The resultant population of Staphylococcus aureus, Enterococcus faecalis, or other Gram-positive colonies can then be evaluated to determine whether the target sequence is required for proliferation by PCR amplification of the affected target sequence. If the targeted gene is not required for proliferation, then PCR analysis will show that roughly equal numbers of colonies have retained either the wild-type or the mutant allele. If the targeted gene is required for proliferation, then only wild-type alleles will be recovered in the PCR analysis.

[0584] The method of cross-over PCR is used to generate the mutant allele by amplification of sequences flanking but not including the coding region of the gene of interest, such that overlap between the resulting two PCR amplification products allows them to hybridize. Further PCR amplification of this hybridization product using primers representing the extreme 5′ and 3′ ends can produce an amplification product containing an in-frame deletion of the coding region but retaining substantial flanking sequences.

[0585] For Staphylococcus aureus, this amplification product is subcloned into the suicide vector pSA3182 (Xia, M., et al. 1999 Plasmid 42:144-149, the disclosure of which is incorporated herein by reference in its entirety) which is host-dependent for autonomous replication. This vector includes a tetC tetracycline-resistance marker and the origin of replication of the well-known Staphylococcus aureus plasmid pT181 (Mojumdar, M and Kahn, S. A., Characterisation of the Tetracycline Resistance Gene of Plasmid pT181, J. Bacteriol. 170: 5522 (1988), the disclosure of which is incorporated herein by reference in its entirety). The vector lacks the repC gene which is required for autonomous replication of the vector at the pT181 origin. However autonomous replication of this vector can occur in a Staphylococcus aureus host strain such as SA3528, which expresses repC in trans. Once the amplified truncated target gene sequence is cloned and propagated in the pSA3182 vector, it can then be introduced into a repC minus strain such as RN4220 (Kreiswirth, B. N. et al., The Toxic Shock Syndrome Exotoxin Structural Gene is Not Detectably Transmitted by a Prophage, Nature 305:709-712 (1983), the disclosure of which is incorporated herein by reference in its entirety) by electroporation with selection for tetracycline resistance. In this strain, the vector must integrate by homologous recombination at the targeted gene in the chromosome to impart drug resistance. This results in a inserted truncated copy of the allele, followed by pSA3182 vector sequence, and finally an intact and functional allele of the targeted gene.

[0586] Once a tetR Staphylococcus aureus strain is isolated from the above technique and shown to include truncated and wild-type alleles of the targeted gene as described above, a second plasmid, pSA7592 (Xia, M., et al. 1999 Plasmid 42:144-149, the disclosure of which is incorporated herein by reference in its entirety) is introduced into the strain by electroporation. This gene includes an erythromycin resistance gene and a repC gene that is expressed at high levels. Expression of repC in these transformants is toxic due to interference of normal chromosomal replication at the integrated pT181 origin of replication. This counterselects for strains that have removed the vector sequence by homologous recombination, resulting in either of two outcomes: The counterselected cells either possess a wild-type allele of the targeted gene or a gene in which the wild-type allele has been replaced by the engineered in-frame deletion of the truncated allele.

[0587] PCR amplification can be used to test either outcome among the resulting erythromycin resistant, tet sensitive transformant colonies. If the targeted gene is not required for cellular replication, then PCR evidence for both wild-type and mutant alleles will be found among the population of resultant transformants. However, if the targeted gene is required for cellular replication, then only the wild-type form of the gene will be evident among the resulting transformants.

[0588] Similarly, for Enterococcus faecalis the PCR products containing the mutant allele of the target sequence may be introduced into an appropriate knockout vector and cells in which the wild type target has been disrupted are selected using the appropriate methodology.

[0589] The above methods have the advantage that insertion of an in-frame deletion mutation is far less likely to cause downstream polar effects on genes in the same operon as the targeted gene. However, it will be appreciated that other methods for disrupting Staphylococcus aureus or Enterococcus faecalis, which are familiar to those skilled in the art may also be used. Each gene in the operon may be disrupted using the methodology above to determine whether it is required for proliferation.

[0590] The example below describes approaches to express genes involved in cell proliferation.

EXAMPLE 16 Expression of the Proteins Encoded by Genes Identified as Required for Staphylococcus aureus or Enterococcus faecalis Proliferation

[0591] The following is provided as one exemplary method to express the proliferation-required proteins encoded by sequences identified by the methods described herein using expression systems designed either for E. coli, Staphylococcus aureus, or Enterococcus faecalis. First, the initiation and termination codons for the gene are identified. If desired, methods for improving translation or expression of the protein are well known in the art. For example, if the nucleic acid encoding the polypeptide to be expressed lacks a methionine codon to serve as the initiation site, a strong Shine-Dalgarno sequence, or a stop codon, these sequences can be added. Similarly, if the identified nucleic acid sequence lacks a transcription termination signal, this sequence can be added to the construct by, for example, splicing out such a sequence from an appropriate donor sequence. In addition, the coding sequence may be operably linked to a strong promoter or an inducible promoter, such as the fusion promoters described herein, if desired. The identified nucleic acid sequence or portion thereof encoding the polypeptide to be expressed is obtained by, for example, PCR from the bacterial expression vector or genome using oligonucleotide primers complementary to the identified nucleic acid sequence or portion thereof and containing restriction endonuclease sequences appropriate for inserting the coding sequences into the vector such that the coding sequences can be expressed from the vector's promoter. Alternatively, other conventional cloning techniques may be used to place the coding sequence under the control of the promoter. In some embodiments, a termination signal may be located downstream of the coding sequence such that transcription of the coding sequence ends at an appropriate position.

[0592] Several expression vector systems for protein expression in E. coli are well known and available to someone knowledgeable in the art. The coding sequence may be inserted into any of these vectors and placed under the control of the promoter. The expression vector may then be transformed into DH5α or some other E. coli strain suitable for the over expression of proteins.

[0593] Alternatively, an expression vector encoding a protein required for proliferation of Staphylococcus aureus or Enterococcus faecalis may be introduced into Staphylococcus aureus or Enterococcus faecalis. In some embodiments, gene encoding the protein required for proliferation may be operably linked to one of the fusion promoters described herein. Electroporation protocols for introducing nucleic acids into Staphylococcus aureus and Enterococcus faecalis are well known in the art. For example, the electroporation protocol described in Staphylococcus aureus transformation protocols described in J. C. Lee “Electroporation of Staphylococci” from Methods in Molecular Biology vol 47: Electroporation Protocols for Microorganisms Edited by: J. A. Nickoloff Humana Press Inc., Totowa, N.J. pp209-216, the disclosure of which is incorporated herein by reference in its entirety, may be used. Positive transformants are selected after growing the transformed cells on plates containing an antibiotic to which the vector confers resistance. For example, Staphylococcus aureus may be transformed with an expression vector in which the coding sequence is operably linked to the fusion promoters described herein, such as fusion promoters comprising the T5 promoter operably linked to a xyl operator such that expression of the encoded protein is inducible with xylose. Such an example vector is pXyl-T5 p15a. In another example, Enterococcus faecalis may be transformed with an expression vector in which the coding sequence is operably linked to the fusion promoters described herein, such as fusion promoters comprising the PL promoter operably linked to a xyl operator such that expression of the encoded protein is inducible with xylose. Such an example vector is pEPEF21.

[0594] In one embodiment, the protein is expressed and maintained in the cytoplasm as the native sequence. In an alternate embodiment, the expressed protein can be modified to include a protein tag that allows for differential cellular targeting, such as to the periplasmic space of Gram-negative or Gram-positive expression hosts or to the exterior of the cell (i.e., into the culture medium). In some embodiments, the osmotic shock cell lysis method described in Chapter 16 of Current Protocols in Molecular Biology, Vol. 2, (Ausubel, et al., Eds.) John Wiley & Sons, Inc. (1997) may be used to liberate the polypeptide from the cell. In still another embodiment, such a protein tag could also facilitate purification of the protein from either fractionated cells or from the culture medium by affinity chromatography. Each of these procedures can be used to express a proliferation-required protein.

[0595] Expressed proteins, whether in the culture medium or liberated from the periplasmic space or the cytoplasm, are then purified or enriched from the supernatant using conventional techniques such as ammonium sulfate precipitation, standard chromatography, immunoprecipitation, immunochromatography, size exclusion chromatography, ion exchange chromatography, and HPLC. Alternatively, the polypeptide may be secreted from the host cell in a sufficiently enriched or pure state in the supernatant or growth media of the host cell to permit it to be used for its intended purpose without further enrichment. The purity of the protein product obtained can be assessed using techniques such as SDS PAGE, which is a protein resolving technique well known to those skilled in the art. Coomassie, silver staining or staining with an antibody are typical methods used to visualize the protein of interest.

[0596] Antibodies capable of specifically recognizing the protein of interest can be generated using synthetic peptides using methods well known in the art. See, Antibodies: A Laboratory Manual, (Harlow and Lane, Eds.) Cold Spring Harbor Laboratory (1988). For example, 15-mer peptides having a sequence encoded by the appropriate identified gene sequence of interest or portion thereof can be chemically synthesized. The synthetic peptides are injected into mice to generate antibodies to the polypeptide encoded by the identified nucleic acid sequence of interest or portion thereof. Alternatively, samples of the protein expressed from the expression vectors discussed above can be purified and subjected to amino acid sequencing analysis to confirm the identity of the recombinantly expressed protein and subsequently used to raise antibodies.

[0597] The protein encoded by the identified nucleic acid sequence of interest or portion thereof can be purified using standard immunochromatography techniques. In such procedures, a solution containing the desired protein, such as the culture medium or a cell extract, is applied to a column having antibodies against the secreted protein attached to the chromatography matrix. The desired protein is allowed to bind the immunochromatography column. Thereafter, the column is washed to remove non-specifically bound proteins. The specifically-bound secreted protein is then released from the column and recovered using standard techniques. These procedures are well known in the art.

[0598] In an alternative protein purification scheme, the identified nucleic acid sequence of interest or portion thereof can be incorporated into expression vectors designed for use in purification schemes employing chimeric polypeptides. In such strategies the coding sequence of the identified nucleic acid sequence of interest or portion thereof is inserted in-frame with the gene encoding the other half of the chimera. The other half of the chimera can be maltose binding protein (MBP) or a nickel binding polypeptide encoding sequence. A chromatography matrix having antibody to MBP or nickel attached thereto is then used to purify the chimeric protein. Protease cleavage sites can be engineered between the MBP gene or the nickel binding polypeptide and the identified expected gene of interest, or portion thereof. Thus, the two polypeptides of the chimera can be separated from one another by protease digestion.

[0599] One useful expression vector for generating maltose binding protein fusion proteins is pMAL (New England Biolabs), which encodes the malE gene. In the pMal protein fusion system, the cloned gene is inserted into a pMal vector downstream from the malE gene. This results in the expression of an MBP-fusion protein. The fusion protein is purified by affinity chromatography. These techniques as described are well known to those skilled in the art of molecular biology. An Example describing in detail the generation of monoclonal and polyclonal antibodies appears below.

EXAMPLE 17 Production of an Antibody to an Isolated Staphylococcus aureus or Enterococcus faecalis Protein

[0600] Substantially pure protein or polypeptide is isolated from the transformed cells as described in Example 16. The concentration of protein in the final preparation is adjusted, for example, by concentration on a 10,000 molecular weight cut off AMICON filter device (Millipore, Bedford, Mass.), to the level of a few micrograms/ml. Monoclonal or polyclonal antibody to the protein can then be prepared as follows:

[0601] Monoclonal antibody to epitopes of any of the peptides identified and isolated as described can be prepared from murine hybridomas according to the classical method of Kohler, G. and Milstein, C., Nature 256:495 (1975) or any of the well-known derivative methods thereof. Briefly, a mouse is repetitively inoculated with a few micrograms of the selected protein or peptides derived therefrom over a period of a few weeks. The mouse is then sacrificed, and the antibody-producing cells of the spleen isolated. The spleen cells are fused by means of polyethylene glycol with mouse myeloma cells, and the excess unfused cells destroyed by growth of the system on selective media comprising aminopterin (HAT media). The successfully-fused cells are diluted and aliquots of the dilution placed in wells of a microtiter plate where growth of the culture is continued. Antibody-producing clones are identified by detection of antibody in the supernatant fluid of the wells by immunoassay procedures, such as ELISA, as described by Engvall, E., “Enzyme immunoassay ELISA and EMIT,” Meth. Enzymol. 70:419 (1980), and derivative methods thereof. Selected positive clones can be expanded and their monoclonal antibody product harvested for use. Detailed procedures for monoclonal antibody production are described in Davis, L. et al. Basic Methods in Molecular Biology Elsevier, New York. Section 21-2.

[0602] Polyclonal antiserum containing antibodies to heterogeneous epitopes of a single protein or a peptide can be prepared by immunizing suitable animals with the expressed protein or peptides derived therefrom described above, which can be unmodified or modified to enhance immunogenicity. Effective polyclonal antibody production is affected by many factors related both to the antigen and the host species. For example, small molecules tend to be less immunogenic than larger molecules and can require the use of carriers and adjuvant. Also, host animals vary in response to site of inoculations and dose, with both inadequate or excessive doses of antigen resulting in low titer antisera. Small doses (ng level) of antigen administered at multiple intradermal sites appears to be most reliable. An effective immunization protocol for rabbits can be found in Vaitukaitis, J. et al. J. Clin. Endocrinol. Metab. 33:988-991 (1971).

[0603] Booster injections can be given at regular intervals, and antiserum harvested when antibody titer thereof, as determined semi-quantitatively, for example, by double immunodiffusion in agar against known concentrations of the antigen, begins to fall. See, for example, Ouchterlony, O. et al., Chap. 19 in: Handbook of Experimental Immunology D. Wier (ed) Blackwell (1973). Plateau concentration of antibody is usually in the range of 0.1 to 0.2 mg/ml of serum (about 12:M). Affinity of the antisera for the antigen is determined by preparing competitive binding curves, as described, for example, by Fisher, D., Chap. 42 in: Manual of Clinical Immunology, 2d Ed. (Rose and Friedman, Eds.) Amer. Soc. For Microbiol., Washington, D.C. (1980).

[0604] Antibody preparations prepared according to either protocol are useful in quantitative immunoassays which determine concentrations of antigen-bearing substances in biological samples; they are also used semi-quantitatively or qualitatively to identify the presence of antigen in a biological sample. The antibodies can also be used in therapeutic compositions for killing bacterial cells expressing the protein. The next section describes how to make and use an in vivo system for performing in vivo analysis on bacterial strains containing regulatable essential genes and for screening compounds that inhibit bacterial proliferation.

[0605] The example below describes the an in vivo system for analyzing the growth characteristics of S. aureus and Enterococcus faecalis that contain regulatable essential genes.

EXAMPLE 18 In vivo Models for Analyzing the Regulation of Essential Genes and Screening Therapeutic Compounds

[0606] Whether a given gene is essential for a bacterium growing in nature (in vivo) or whether a gene is required for infection of a host organism can be determined by constructing a conditional allele of that gene which is suitable for animal testing. Expression of an antisense RNA molecule complementary to a candidate gene can, by decreasing synthesis of the candidate gene product, constitute a conditional allele that is suitable for animal testing. Candidate genes essential in vivo or required for virulence can be tested by preparing bacterial strains using the EGI, antisense, promoter replacement, operator insertion, and random promoter insertion techniques. Preferably, the constructs used to prepare these bacterial strains contain an expression vector under control of a fusion promoter described herein. Animals infected by these bacterial strains can be used to evaluate the ability to modulate bacterial proliferation in vivo by adding or decreasing the concentration of inducer and can be used to screen compounds in vivo to validate their therapeutic properties.

[0607] Stable transformation of Staphylococcus aureus or Enterococcus faecalis with constructs prepared as described herein can be accomplished by electroporation and selection for clones resistant to 15 μg/ml chloramphenicol in the case of Staphylococcus aureus of 20 μg/ml erythromycin in the case of Enterococcus faecalis. Preferably, whether each construct is essential is verified in vitro using induction in medium containing various amounts of inducer or an analog thereof, prior to introducing the cells into animals.

[0608] For the thigh infection model, animals are rendered neutropenic chemically prior to the start of the experiment. By one approach, the inoculum will consist of 10⁵ to 10⁶ cells of a normally virulent Gram-positive organism expressing antisense to an essential gene such as that for gyrB under control of a fusion promoter described herein. These cells are injected to one thigh of a suitable animal (e.g., mice). Most microorganisms attain a logarithmic growth 2 hours after inoculation. Treated animals receive therapy from 2 hours post inoculation up to 24 hours. A typical efficacy study consists of 2 control groups and 5 treatment groups, each being treated with different doses of inducer. With 15 animals per group, 105 mice are used per study. The control sets are given intraperitoneal injections of saline, which will not induce expression of the fusion promoter. The experimental animals are given the inducer (e.g. xylose, tetracycline, IPTG, maltose, absence of tryptophan, or temperature change) in intraperitoneal injections to induce expression of the antisense promoter. Alternatively, the promoter can be induced by intravenous infusion of inducer at sub-toxic levels.

[0609] The end point used to follow the infection process is viable bacterial counts per thigh. The controls in which the antisense RNA is not induced will fail to impede growth of the Gram-positive organism and thus a logarithmic increase in viable bacteria will occur. The Gram-positive cells recovered from the site of infection should be viable until antisense expression is subsequently induced. This will demonstrate that the plasmid is still patent. However, in animals receiving the xylose injections, expression of the antisense RNA will occur, the essential gene or gene product will be compromised, and the Gram-positive cells infecting the mice will not multiply. Accordingly, fewer viable cells will be recovered from the site of infection in the experimental animals. The Gram-positive cells from the induced mice will be recovered, if still present, and assayed as above to determine if the promoter and gene are still present and functional.

[0610] The sections below describe the use of fusion promoters in methods to discover compounds that inhibit bacterial proliferation.

EXAMPLE 19 Screening Chemical Libraries in Cells Sensitized with Antisense RNA Transcribed from Fusion Promoters

[0611] Having isolated and expressed bacterial proteins shown to be required for bacterial proliferation, the present invention further contemplates the use of these expressed target proteins in assays to screen libraries of compounds for potential drug candidates. The generation of chemical libraries is well known in the art. For example, combinatorial chemistry can be used to generate a library of compounds to be screened in the assays described herein. A combinatorial chemical library is a collection of diverse chemical compounds generated by either chemical synthesis or biological synthesis by combining a number of chemical “building block” reagents. For example, a linear combinatorial chemical library such as a polypeptide library is formed by combining amino acids in every possible combination to yield peptides of a given length. Millions of chemical compounds theoretically can be synthesized through such combinatorial mixings of chemical building blocks. For example, one commentator observed that the systematic, combinatorial mixing of 100 interchangeable chemical building blocks results in the theoretical synthesis of 100 million tetrameric compounds or 10 billion pentameric compounds. (Gallop et al., 1994, “Applications of Combinatorial Technologies to Drug Discovery, Background and Peptide Combinatorial Libraries” Journal of Medicinal Chemistry, 37: 1233-1250). Other chemical libraries known to those in the art may also be used, including natural product libraries.

[0612] Once generated, combinatorial libraries can be screened for compounds that possess desirable biological properties. For example, compounds which may be useful as drugs or to develop drugs would likely have the ability to bind to the target protein identified, expressed and purified as discussed above. Further, if the identified target protein is an enzyme, candidate compounds would likely interfere with the enzymatic properties of the target protein. For example, the enzymatic function of a target protein may be to serve as a protease, nuclease, phosphatase, dehydrogenase, transporter protein, transcriptional enzyme, and any other type of enzyme known or unknown. Thus, the present invention contemplates using the protein products described above to screen combinatorial chemical libraries.

[0613] In one example, the target protein is a serine protease and the substrate of the enzyme is known. The present example is directed towards the analysis of libraries of compounds to identify compounds that function as inhibitors of the target enzyme. First, a library of small molecules is generated using methods of combinatorial library formation well known in the art. U.S. Pat. Nos. 5,463,564 and 5,574,656, to Agrafiotis, et al., entitled “System and Method of Automatically Generating Chemical Compounds with Desired Properties,” the disclosures of which are incorporated herein by reference in their entireties, are two such teachings. Then the library compounds are screened to identify those compounds that possess desired structural and functional properties. U.S. Pat. No. 5,684,711, the disclosure of which is incorporated herein by reference in its entirety, also discusses a method for screening libraries.

[0614] To illustrate the screening process, the target polypeptide and chemical compounds of the library are combined with one another and permitted to interact with one another. A labeled substrate is added to the incubation. The label on the substrate is such that a detectable signal is emitted from metabolized substrate molecules. The emission of this signal permits one to measure the effect of the combinatorial library compounds on the enzymatic activity of target enzymes by comparint it to the signal emitted in the absence of combinatorial library compounds. The characteristics of each library compound are encoded so that compounds demonstrating activity against the enzyme can be analyzed and features common to the various compounds identified can be isolated and combined into future iterations of libraries.

[0615] Once a library of compounds is screened, subsequent libraries are generated using those chemical building blocks that possess the features shown in the first round of screen to have activity against the target enzyme. Using this method, subsequent iterations of candidate compounds will possess more and more of those structural and functional features required to inhibit the function of the target enzyme, until a group of enzyme inhibitors with high specificity for the enzyme can be found. These compounds can then be further tested for their safety and efficacy as antibiotics for use in mammals.

[0616] It will be readily appreciated that this particular screening methodology is exemplary only. Other methods are well known to those skilled in the art. For example, a wide variety of screening techniques are known for a large number of naturally-occurring targets when the biochemical function of the target protein is known. For example, some techniques involve the generation and use of small peptides to probe and analyze target proteins both biochemically and genetically in order to identify and develop drug leads. Such techniques include the methods described in PCT publications No. WO9935494, WO9819162, WO9954728, the disclosures of which are incorporated herein by reference in their entireties.

[0617] Current cell-based assays used to identify or to characterize compounds for drug discovery and development frequently depend on detecting the ability of a test compound to modulate the activity of a target molecule located within a cell or located on the surface of a cell. Most often such target molecules are proteins such as enzymes, receptors and the like. However, target molecules may also include other molecules such as DNAs, lipids, carbohydrates and RNAs including messenger RNAs, ribosomal RNAs, tRNAs and the like. A number of highly sensitive cell-based assay methods are available to those of skill in the art to detect binding and interaction of test compounds with specific target molecules. However, these methods are generally not highly effective when the test compound binds to or otherwise interacts with its target molecule with moderate or low affinity. In addition, the target molecule may not be readily accessible to a test compound in solution, such as when the target molecule is located inside the cell or within a cellular compartment such as the periplasm of a bacterial cell. Thus, current cell-based assay methods are limited in that they are not effective in identifying or characterizing compounds that interact with their targets with moderate to low affinity or compounds that interact with targets that are not readily accessible.

[0618] The cell-based assay methods of the present invention have substantial advantages over current cell-based assays. These advantages derive from the use of sensitized cells in which the level or activity of at least one proliferation-required gene product (the target molecule) has been specifically reduced to the point where the presence or absence of its function becomes a rate-determining step for cellular proliferation. In the cell-based assays of the present invention, the level or activity of a proliferation-required gene product is reduced by transcribing an antisense nucleic acid complementary to at least a portion of the nucleic acid encoding the gene product from one of the fusion promoters described herein. Such sensitized cells become much more sensitive to compounds that are active against the affected target molecule. Thus, cell-based assays of the present invention are capable of detecting compounds exhibiting low or moderate potency against the target molecule of interest because such compounds are substantially more potent on sensitized cells than on non-sensitized cells. The effect may be such that a test compound may be two to several times more potent, at least 10 times more potent, at least 20 times more potent, at least 50 times more potent, at least 100 times more potent, at least 1000 times more potent, or even more than 1000 times more potent when tested on the sensitized cells as compared to the non-sensitized cells.

[0619] Due in part to the increased appearance of antibiotic resistance in pathogenic microorganisms and to the significant side-effects associated with some currently used antibiotics, novel antibiotics acting at new targets are highly sought after in the art. Yet, another limitation in the current art related to cell-based assays is the problem of repeatedly identifying hits against the same kinds of target molecules in the same limited set of biological pathways. This may occur when compounds acting at such new targets are discarded, ignored or fail to be detected because compounds acting at the “old” targets are encountered more frequently and are more potent than compounds acting at the new targets. As a result, the majority of antibiotics in use currently interact with a relatively small number of target molecules within an even more limited set of biological pathways.

[0620] The use of sensitized cells comprising the fusion promoters of the current invention provides a solution to the above problem in two ways. First, desired compounds acting at a target of interest, whether a new target or a previously known but poorly exploited target, can now be detected above the “noise” of compounds acting at the “old” targets due to the specific and substantial increase in potency of such desired compounds when tested on the sensitized cells of the current invention. Second, the methods used to sensitize cells to compounds acting at a target of interest may also sensitize these cells to compounds acting at other target molecules within the same biological pathway. For example, expression of an antisense molecule to a gene encoding a ribosomal protein is expected to sensitize the cell to compounds acting at that ribosomal protein and may also sensitize the cells to compounds acting at any of the ribosomal components (proteins or rRNA) or even to compounds acting at any target which is part of the protein synthesis pathway. Thus an important advantage of the present invention is the ability to reveal new targets and pathways that were previously not readily accessible to drug discovery methods.

[0621] Sensitized cells of the present invention are prepared by reducing the activity or level of a target molecule. The target molecule may be a gene product, such as an RNA or polypeptide produced from the proliferation-required nucleic acids described herein. Alternatively, the target may be a gene product such as an RNA or polypeptide which is produced from a sequence within the same operon as the proliferation-required nucleic acids described herein. In addition, the target may be an RNA or polypeptide in the same biological pathway as the proliferation-required nucleic acids described herein. Such biological pathways include, but are not limited to, enzymatic, biochemical and metabolic pathways as well as pathways involved in the production of cellular structures such the cell wall.

[0622] Current methods employed in the arts of medicinal and combinatorial chemistries are able to make use of structure-activity relationship information derived from testing compounds in various biological assays including direct binding assays and cell-based assays. Occasionally compounds are directly identified in such assays that are sufficiently potent to be developed as drugs. More often, initial hit compounds exhibit moderate or low potency. Once a hit compound is identified with low or moderate potency, directed libraries of compounds are synthesized and tested in order to identify more potent leads. Generally these directed libraries are combinatorial chemical libraries consisting of compounds with structures related to the hit compound but containing systematic variations including additions, subtractions and substitutions of various structural features. When tested for activity against the target molecule, structural features are identified that either alone or in combination with other features enhance or reduce activity. This information is used to design subsequent directed libraries containing compounds with enhanced activity against the target molecule. After one or several iterations of this process, compounds with substantially increased activity against the target molecule are identified and may be further developed as drugs. This process is facilitated by use of the sensitized cells of the present invention since compounds acting at the selected targets exhibit increased potency in such cell-based assays, thus; more compounds can now be characterized providing more useful information than would be obtained otherwise.

[0623] Thus, it is now possible using cell-based assays of the present invention to identify or characterize compounds that previously would not have been readily identified or characterized including compounds that act at targets that previously were not readily exploited using cell-based assays. The process of evolving potent drug leads from initial hit compounds is also substantially improved by the cell-based assays of the present invention because, for the same number of test compounds, more structure-function relationship information is likely to be revealed.

[0624] The method of sensitizing a cell entails selecting a suitable gene or operon. A suitable gene or operon is one whose expression is required for the proliferation of the cell to be sensitized. The next step is to introduce into the cells to be sensitized, an antisense RNA capable of hybridizing to the suitable gene or operon or to the RNA encoded by the suitable gene or operon. Introduction of the antisense RNA can be in the form of an expression vector in which antisense RNA is produced under the control of a fusion promoter as described herein. The amount of antisense RNA produced is limited by varying the inducer concentration to which the cell is exposed and thereby varying the activity of the promoter driving transcription of the antisense RNA. Thus, cells are sensitized by exposing them to an inducer concentration that results in a sub-lethal level of antisense RNA expression.

[0625] In one embodiment of the cell-based assays, an antisense nucleic acid that is complementary to a proliferation-required gene from a Gram-positive organism is used to inhibit the production of a proliferation-required protein. Expression vectors which contain an antisense nucleic acid complementary to identified genes required for proliferation operably linked to a fusion promoter of the present invention are used to limit the concentration of a proliferation-required protein without severely inhibiting growth. To achieve that goal, a growth inhibition dose curve of inducer is calculated by plotting various doses of inducer against the corresponding growth inhibition caused by the antisense expression. From this curve, various percentages of antisense induced growth inhibition, from 1 to 100% can be determined. If the fusion promoter contained in the expression vector contains a xylO linked to one of the promoters of SEQ ID NO.: 36-45, transcription in Staphylococcus aureus and Enterococcus faecalis will be regulatable by the xyl repressor and expression from the promoter can be induced with xylose. Similarly, IPTG, xylose, tetracycline, maltose, absence of trp and temperature inducible promoters may be used. For example, the highest concentration of the inducer that does not reduce the growth rate significantly can be estimated from the curve. Cellular proliferation can be monitored by growth medium turbidity via OD measurements. In another example, the concentration of inducer that reduces growth by 25% can be predicted from the curve. In still another example, a concentration of inducer that reduces growth by 50% can be calculated. Additional parameters such as colony forming units (cfu) can be used to measure cellular viability.

[0626] Cells to be assayed are exposed to the above-determined concentrations of inducer. The presence of the inducer at this sub-lethal concentration reduces the amount of the proliferation required gene product to the lowest amount in the cell that will support growth. Cells grown in the presence of this concentration of inducer are therefore specifically more sensitive to inhibitors of the proliferation-required protein or RNA of interest or to inhibitors of proteins or RNAs in the same biological pathway as the proliferation-required protein or RNA of interest but not to inhibitors of unrelated proteins or RNAs.

[0627] Cells pretreated with sub-inhibitory concentrations of inducer and thus containing a reduced amount of proliferation-required target gene product are then used to screen for compounds that reduce cell growth. The sub-lethal concentration of inducer may be any concentration consistent with the intended use of the assay to identify candidate compounds to which the cells are more sensitive. For example, the sub-lethal concentration of the inducer may be such that growth inhibition is at least about 5%, at least about 8%, at least about 10%, at least about 20%, at least about 30%, at least about 40%, at least about 50%, at least about 60% at least about 75%, or more. Cells which are pre-sensitized using the preceding method are more sensitive to inhibitors of the target protein because these cells contain less target protein to inhibit than wild-type cells.

[0628] It will be appreciated that the cell-based assays described above may be implemented in Staphylococcus aureus and Enterococcus faecalis using proliferation-inhibiting antisense sequences under the control of a fusion promoter described herein. It will also be appreciated that the above cell-based assays employing the fusion promoters described herein can be implemented in other Gram-positive organims, including but not limited to, Bacillus anthracis, Clostridium botulinum, Clostridium difficile, Clostridium perfringens, Clostridium teteni, Corynebacterium diptheriae, Enterococcus faecalis, Enterococcus faecium, Lactococcus lactis, Listeria monocytogenes, Mycobacterium leprae, Mycobacterium tuberculosis, Nocardia asteroides, Staphylococcus aureus, Staphylococcus epidermidis, Staphylococcus xylosis, Streptococcus pneumoniae, Streptococcus mutans or any species falling within the genera of any of the above species.

[0629] In another embodiment of the cell-based assays of the present invention, the level or activity of a proliferation required gene product is reduced using a mutation, such as a temperature sensitive mutation, in the proliferation-required sequence and an antisense nucleic acid complementary to the proliferation-required sequence which is transcribed from one of the fusion promoters described herein. Growing the cells at an intermediate temperature between the permissive and restrictive temperatures of the temperature sensitive mutant where the mutation is in a proliferation-required gene produces cells with reduced activity of the proliferation-required gene product. The antisense RNA complementary to the proliferation-required sequence further reduces the activity of the proliferation required gene product. Drugs that may not have been found using either the temperature sensitive mutation or the antisense nucleic acid alone may be identified by determining whether cells in which expression of the antisense nucleic acid has been induced and which are grown at a temperature between the permissive temperature and the restrictive temperature are substantially more sensitive to a test compound than cells in which expression of the antisense nucleic acid has not been induced and which are grown at a permissive temperature. Also drugs found previously from either the antisense nucleic acid alone or the temperature sensitive mutation alone may have a different sensitivity profile when used in cells combining the two approaches, and that sensitivity profile may indicate a more specific action of the drug in inhibiting one or more activities of the gene product.

[0630] Temperature sensitive mutations may be located at different sites within the gene and correspond to different domains of the protein. For example, the dnaB gene of Escherichia coli encodes the replication fork DNA helicase. DnaB has several domains, including domains for oligomerization, ATP hydrolysis, DNA binding, interaction with primase, interaction with DnaC, and interaction with DnaA [(Biswas, E. E. and Biswas, S. B. 1999, “Mechanism and DnaB helicase of Escherichia coli: structural domains involved in ATP hydrolysis, DNA binding, and oligomerization” Biochem. 38: 10919-10928; Hiasa, H. and Marians, K. J. 1999, “Initiation of bidirectional replication at the chromosomal origin is directed by the interaction between helicase and primase” J. Biol. Chem. 274: 27244-27248; San Martin, C., et al. 1998, “Three-dimensional reconstructions from cryoelectron microscopy images reveal an intimate complex between helicase DnaB and its loading partner DnaC” Structure 6:501-9; Sutton, M. D. et al. 1998. “Escherichia coli DnaA protein. The N-terminal domain and loading of DnaB helicase at the E. coli chromosomal origin” J. Biol. Chem. 273: 34255-62.), the disclosures of which are incorporated herein by reference in their entireties]. Temperature sensitive mutations in different domains of DnaB confer different phenotypes at the restrictive temperature, which include either an abrupt stop or slow stop in DNA replication with or without DNA breakdown (Wechsler, J. A. and Gross, J. D. 1971. “Escherichia coli mutants temperature-sensitive for DNA synthesis” Mol. Gen. Genetics 113: 273-284, the disclosure of which is incorporated herein by reference in its entirety) and termination of growth or cell death. Combining the use of temperature sensitive mutations in the dnaB gene that cause cell death at the restrictive temperature with an antisense to the dnaB gene could lead to the discovery of very specific and effective inhibitors of one or a subset of activities exhibited by DnaB.

[0631] It will be appreciated that the above method may be performed with any mutation which reduces but does not eliminate the activity or level of the gene product which is required for proliferation.

[0632] When screening for antimicrobial agents against a gene product required for proliferation, growth inhibition of cells containing a limiting amount of that proliferation-required gene product can be assayed. Growth inhibition can be measured by directly comparing the amount of growth, measured by the optical density of the growth medium, between an experimental sample and a control sample. Alternative methods for assaying cell proliferation include measuring the activity of any of a variety of reporter genes, various enzymatic activity assays, and other methods well known in the art.

[0633] It will be appreciated that the above method may be performed in solid phase, liquid phase or a combination of the two. For example, cells grown on nutrient agar containing the inducer of the antisense construct may be exposed to compounds spotted onto the agar surface. A compound's effect may be judged from the diameter of the resulting killing zone, the area around the compound application point in which cells do not grow. Multiple compounds may be transferred to agar plates and simultaneously tested using automated and semi-automated equipment including but not restricted to multi-channel pipettes (for example the Beckman Multimek) and multi-channel spotters (for example the Genomic Solutions Flexys). In this way multiple plates and thousands to millions of compounds may be tested per day.

[0634] The compounds may also be tested entirely in liquid phase using microtiter plates as described below. Liquid phase screening may be performed in microtiter plates containing 96, 384, 1536 or more wells per microtiter plate to screen multiple plates and thousands to millions of compounds per day. Automated and semi-automated equipment may be used for addition of reagents (for example cells and compounds) and determination of cell density.

[0635] The examples below describe cell-based assays that exploit the fusion promoters described herein to sensitize cells and allow for the discovery of a diverse class of antibiotics. Example 20 describes experiments performed in E. coli, which verify the effectiveness of the cell-based approach described above.

EXAMPLE 20 Cell-Based Assay Using Antisense RNA Complementary to Genes Encoding Ribosomal Proteins

[0636] The effectiveness of the above cell-based assay was validated using constructs expressing antisense RNA to the proliferation required E. coli genes rplL, rplJ, and rplW encoding ribosomal proteins L7/L12, L10 and L23 respectively. These proteins are part of the protein synthesis apparatus of the cell and as such are required for proliferation. These constructs were used to test the effect of antisense expression on cell sensitivity to antibiotics known to bind to the ribosome and thereby inhibit protein synthesis. Constructs expressing antisense RNA to several other genes (elaD, visC, yohH, and atpE/B), the products of which are not involved in protein synthesis were used for comparison.

[0637] First, pLex5BA (Krause et al. 1997. J. Mol. Biol. 274: 365, the disclosure of which is incorporated herein by reference in its entirety) expression vectors containing antisense constructs to either rplW or to elaD were introduced into separate E. coli cell populations. Vector introduction is a technique well known to those of ordinary skill in the art. The expression vectors of this example contain IPTG inducible promoters that drive the expression of the antisense RNA in the presence of the inducer. However, those skilled in the art will appreciate that other inducible promoters may also be used. Suitable expression vectors are also well known in the art. The E. coli antisense clones to genes encoding ribosomal proteins L7/L12, L10 and L23 were used to test the effect of antisense expression on cell sensitivity to the antibiotics known to bind to these proteins. Expression vectors containing antisense to either the genes encoding L7/L12 and L10 or L23 were introduced into separate E. coli cell populations.

[0638] The cell populations were exposed to a range of IPTG concentrations in liquid medium to obtain the growth inhibitory dose curve for each clone. First, seed cultures were grown to a particular turbidity that is measured by the optical density (OD) of the growth solution. The OD of the solution is directly related to the number of bacterial cells contained therein. Subsequently, sixteen 200 ul liquid medium cultures were grown in a 96 well microtiter plate at 37° C. with a range of IPTG concentrations in duplicate two-fold serial dilutions from 1600 uM to 12.5 uM (final concentration). Additionally, control cells were grown in duplicate without IPTG. These cultures were started from equal amounts of cells derived from the same initial seed culture of a clone of interest. The cells were grown for up to 15 hours and the extent of growth was determined by measuring the optical density of the cultures at 600 nm. When the control culture reached mid-log phase the percent growth (relative to the control culture) for each of the IPTG containing cultures was plotted against the log concentrations of IPTG to produce a growth inhibitory dose response curve for the IPTG. The concentration of IPTG that inhibits cell growth to 50% (IC₅₀) as compared to the 0 mM IPTG control (0% growth inhibition) was then calculated from the curve. Under these conditions, an amount of antisense RNA was produced that reduced the expression levels of rplW and elaD to a degree such that growth was inhibited by 50%.

[0639] Alternative methods of measuring growth are also contemplated. Examples of these methods include measurements of proteins, the expression of which is engineered into the cells being tested and can readily be measured. Examples of such proteins include green fluorescent protein (GFP) and various enzymes.

[0640] Cells were pretreated with the selected concentration of IPTG and then used to test the sensitivity of cell populations to tetracycline, erythromycin and other protein synthesis inhibitors. FIG. 12 is an IPTG dose response curve in E. coli transformed with an IPTG-inducible plasmid containing either an antisense clone to the E. coli rplW gene (AS-rplW) which encodes ribosomal protein L23 which is required for protein synthesis and essential for cell proliferation, or an antisense clone to the elaD (AS-elaD) gene which is not known to be involved in protein synthesis and which is also essential for proliferation.

[0641] An example of a tetracycline dose response curve is shown in FIGS. 13A and 13B for the rplW and elaD genes, respectively. Cells were grown to log phase and then diluted into media alone or media containing IPTG at concentrations which give 20% and 50% growth inhibition as determined by IPTG dose response curves. After 2.5 hours, the cells were diluted to a final OD₆₀₀ of 0.002 into 96 well plates containing (1) +/−IPTG at the same concentrations used for the 2.5 hour pre-incubation; and (2) serial two-fold dilutions of tetracycline such that the final concentrations of tetracycline range from 1 μg/ml to 15.6 ng/ml and 0 μg/ml. The 96 well plates were incubated at 37° C. and the OD₆₀₀ was read by a plate reader every 5 minutes for up to 15 hours. For each IPTG concentration and the no IPTG control, tetracycline dose response curves were determined when the control (absence of tetracycline) reached 0.1 OD₆₀₀. To compare tetracycline sensitivity with and without IPTG, tetracycline IC_(50s) were determined from the dose response curves (FIGS. 13A and 13B). Cells with reduced levels of L23 (AS-rplW) showed increased sensitivity to tetracycline (FIG. 13A) as compared to cells with reduced levels of the elaD gene product (AS-elaD) (FIG. 13B). FIG. 14 shows a summary bar chart in which the ratios of tetracycline IC_(50s) determined in the presence of IPTG which gives 50% growth inhibition versus tetracycline IC_(50s) determined without IPTG (fold increase in tetracycline sensitivity) were plotted. Cells with reduced levels of either L7/L12 (encoded by genes rplL, rplJ) or L23 (encoded by the rplW gene) showed increased sensitivity to tetracycline (FIG. 14). Cells expressing antisense to genes not known to be involved in protein synthesis (AS-atpB/E, AS-visC, AS-elaD, AS-yohH) did not show the same increased sensitivity to tetracycline, validating the specificity of this assay (FIG. 14).

[0642] In addition to the above, it has been observed in initial experiments that clones expressing antisense RNA to genes involved in protein synthesis (including genes encoding ribosomal proteins L7/L12 & L10, L7/L12 alone, L22, and L18, as well as genes encoding rRNA and Elongation Factor G) have increased sensitivity to the macrolide, erythromycin, whereas clones expressing antisense to the non-protein synthesis genes elaD, atpB/E and visC do not. Furthermore, the clone expressing antisense to rplL and rplJ does not show increased sensitivity to nalidixic acid and ofloxacin, antibiotics which do not inhibit protein synthesis.

[0643] The results with the ribosomal protein genes rplL, rplJ, and rplW as well as the initial results using various other antisense clones and antibiotics show that limiting the concentration of an antibiotic target makes cells more sensitive to the antimicrobial agents that specifically interact with that protein. The results also show that these cells are sensitized to antimicrobial agents that inhibit the overall function in which the protein target is involved but are not sensitized to antimicrobial agents that inhibit other functions.

[0644] It will be appreciated that the cell-based assays described above may be implemented in Staphylococcus aureus and Enterococcus faecalis using proliferation-inhibiting antisense sequences under the control of a fusion promoter described herein. It will also be appreciated that the above cell-based assays employing the fusion promoters described herein can be implemented in other Gram-positive organims, including but not limited to, Bacillus anthracis, Clostridium botulinum, Clostridium difficile, Clostridium perfringens, Clostridium teteni, Corynebacterium diptheriae, Enterococcus faecalis, Enterococcus faecium, Lactococcus lactis, Listeria monocytogenes, Mycobacterium leprae, Mycobacterium tuberculosis, Nocardia asteroides, Staphylococcus aureus, Staphylococcus epidermidis, Staphylococcus xylosis, Streptococcus pneumoniae, Streptococcus mutans or any species falling within the genera of any of the above species.

[0645] The cell-based assay described above may also be used to identify the biological pathway in which a proliferation-required nucleic acid or its gene product lies. In such methods, cells transcribing a sub-lethal level of antisense to a target proliferation-required nucleic acid from the fusion promoters described herein and control cells in which expression of the antisense has not been induced are contacted with a panel of antibiotics known to act in various pathways. If the antibiotic acts in the pathway in which the target proliferation-required nucleic acid or its gene product lies, cells in which expression of the antisense has been induced will be more sensitive to the antibiotic than cells in which expression of the antisense has not been induced.

[0646] As a control, the results of the assay may be confirmed by contacting a panel of cells expressing antisense nucleic acids to many different proliferation-required genes including the target proliferation-required gene. If the antibiotic is acting specifically, heightened sensitivity to the antibiotic will be observed only in the cells expressing antisense to a target proliferation-required gene (or cells expressing antisense to other proliferation-required genes in the same pathway as the target proliferation-required gene) but will not be observed generally in all cells expressing antisense to proliferation-required genes.

[0647] It will be appreciated that the cell-based assays described above may be implemented in Staphylococcus aureus and Enterococcus faecalis using proliferation-inhibiting antisense sequences under the control of a fusion promoter described herein. It will also be appreciated that the above cell-based assays employing the fusion promoters described herein can be implemented in other Gram-positive organims, including but not limited to, Bacillus anthracis, Clostridium botulinum, Clostridium difficile, Clostridium perfringens, Clostridium teteni, Corynebacterium diptheriae, Enterococcus faecalis, Enterococcus faecium, Lactococcus lactis, Listeria monocytogenes, Mycobacterium leprae, Mycobacterium tuberculosis, Nocardia asteroides, Staphylococcus aureus, Staphylococcus epidermidis, Staphylococcus xylosis, Streptococcus pneumoniae, Streptococcus mutans or any species falling within the genera of any of the above species.

[0648] Similarly, the above method may be used to determine the pathway on which a test compound, such as a test antibiotic acts. A panel of cells, each of which transcribes antisense to a proliferation-required nucleic acid in a known pathway from one of the fusion promoters described herein, is contacted with a compound for which it is desired to determine the pathway on which it acts. The sensitivity of the panel of cells to the test compound is determined in cells in which expression of the antisense has been induced and in control cells in which expression of the antisense has not been induced. If the test compound acts on the pathway on which an antisense nucleic acid acts, cells in which expression of the antisense has been induced will be more sensitive to the compound than cells in which expression of the antisense has not been induced. In addition, control cells in which expression of antisense to proliferation-required genes in other pathways has been induced will not exhibit heightened sensitivity to the compound. In this way, the pathway on which the test compound acts may be determined.

[0649] It will be appreciated that the cell-based assays described above may be implemented in Staphylococcus aureus and Enterococcus faecalis using proliferation-inhibiting antisense sequences under the control of a fusion promoter described herein. It will also be appreciated that the above cell-based assays employing the fusion promoters described herein can be implemented in other Gram-positive organims, including but not limited to, Bacillus anthracis, Clostridium botulinum, Clostridium difficile, Clostridium perfringens, Clostridium teteni, Corynebacterium diptheriae, Enterococcus faecalis, Enterococcus faecium, Lactococcus lactis, Listeria monocytogenes, Mycobacterium leprae, Mycobacterium tuberculosis, Nocardia asteroides, Staphylococcus aureus, Staphylococcus epidermidis, Staphylococcus xylosis, Streptococcus pneumoniae, Streptococcus mutans or any species falling within the genera of any of the above species.

[0650] The Example below provides one method for performing such assays.

EXAMPLE 21 Identification of the Pathway in Which a Proliferation-Required Gene Lies or the Pathway on which an Antibiotic Acts

[0651] A. Preparation of Bacterial Stocks for Assay

[0652] To provide a consistent source of cells to screen, frozen stocks of host bacteria containing the desired antisense construct are prepared using standard microbiological techniques. For example, a single clone of the microorganism can be isolated by streaking out a sample of the original stock onto an agar plate containing nutrients for cell growth and an antibiotic for which the antisense construct contains a gene which confers resistance. After overnight growth an isolated colony is picked from the plate with a sterile needle and transferred to an appropriate liquid growth media containing the antibiotic required for maintenance of the plasmid. The cells are incubated at 30° C. to 37° C. with vigorous shaking for 4 to 6 hours to yield a culture in exponential growth. Sterile glycerol is added to 15% (volume to volume) and 100 μL to 500 μL aliquots are distributed into sterile cryotubes, snap frozen in liquid nitrogen, and stored at −80° C. for future assays.

[0653] B. Growth of Bacteria for Use in the Assay

[0654] A day prior to an assay, a stock vial is removed from the freezer, rapidly thawed (37° C. water bath) and a loop of culture is streaked out on an agar plate containing nutrients for cell growth and an antibiotic to which the antisense construct confers resistance. After overnight growth at 37° C., ten randomly chosen, isolated colonies are transferred from the plate (sterile inoculum loop) to a sterile tube containing 5 mL of appropriate medium containing the antibiotic to which the antisense vector confers resistance. After vigorous mixing to form a homogeneous cell suspension, the optical density of the suspension is measured at 600 nm (OD₆₀₀) and if necessary an aliquot of the suspension is diluted into a second tube of 5 mL, sterile, appropriate medium plus antibiotic to achieve an OD₆₀₀≦0.02 absorbance units. The culture is then incubated at 37° C. for 1-2 hrs with shaking until the OD₆₀₀ reaches OD 0.2-0.3. At this point the cells are ready to be used in the assay.

[0655] C. Selection of Media to be Used in Assay

[0656] Two-fold dilution series of the inducer are generated in culture media containing the appropriate antibiotic for maintenance of the antisense construct. Several media are tested side by side and three to four wells are used to evaluate the effects of the inducer at each concentration in each media. For example, LB broth, TBD broth, Muller-Hinton, or other media may be tested with the inducer xylose at the following concentrations, 5 mM, 10 mM, 20 mM, 40 mM, 80 mM, 120 mM and 160 mM. Equal volumes of test media-inducer and cells are added to the wells of a 384 well microtiter plate and mixed. The cells are prepared as described above and diluted 1:100 in the appropriate media containing the test antibiotic immediately prior to addition to the microtiter plate wells. For a control, cells are also added to several wells of each media that do not contain inducer, for example 0 mM xylose. Cell growth is monitored continuously by incubation at 37° C. in a microtiter plate reader monitoring the OD₆₀₀ of the wells over an 18-hour period. The percent inhibition of growth produced by each concentration of inducer is calculated by comparing the rates of logarithmic growth against that exhibited by cells growing in media without inducer. The medium yielding greatest sensitivity to inducer is selected for use in the assays described below.

[0657] D. Measurement of Test Antibiotic Sensitivity in the Absence of Antisense Construct Induction

[0658] Two-fold dilution series of antibiotics of known mechanism of action are generated in the culture media selected for further assay development that has been supplemented with the antibiotic used to maintain the construct. A panel of test antibiotics known to act on different pathways is tested side by side with three to four wells being used to evaluate the effect of a test antibiotic on cell growth at each concentration. Equal volumes of test antibiotic and cells are added to the wells of a 384 well microtiter plate and mixed. Cells are prepared as described above using the media selected for assay development supplemented with the antibiotic required to maintain the antisense construct and are diluted 1:100 in identical media immediately prior to addition to the microtiter plate wells. For a control, cells are also added to several wells that lack antibiotic, but contain the solvent used to dissolve the antibiotics. Cell growth is monitored continuously by incubation at 37° C. in a microtiter plate reader monitoring the OD₆₀₀ of the wells over an 18-hour period. The percent inhibition of growth produced by each concentration of antibiotic is calculated by comparing the rates of logarithmic growth against that exhibited by cells growing in media without antibiotic. A plot of percent inhibition against log[antibiotic concentration] allows extrapolation of an IC₅₀ value for each antibiotic.

[0659] E. Measurement of Test Antibiotic Sensitivity in the Presence of Antisense Construct Inducer

[0660] The culture media selected for use in the assay is supplemented with inducer at concentrations shown to inhibit cell growth by 50% and 80% as described above, as well as the antibiotic used to maintain the construct. Two fold dilution series of the panel of test antibiotics used above are generated in each of these media. Several antibiotics are tested side by side in each medium with three to four wells being used to evaluate the effects of an antibiotic on cell growth at each concentration. Equal volumes of test antibiotic and cells are added to the wells of a 384 well microtiter plate and mixed. Cells are prepared as described above using the media selected for use in the assay supplemented with the antibiotic required to maintain the antisense construct. The cells are diluted 1:100 into two 50 mL aliquots of identical media containing concentrations of inducer that have been shown to inhibit cell growth by 50% and 80% respectively and incubated at 37° C. with shaking for 2.5 hours. Immediately prior to addition to the microtiter plate wells, the cultures are adjusted to an appropriate OD₆₀₀ (typically 0.002) by dilution into warm (37° C.) sterile media supplemented with identical concentrations of the inducer and antibiotic used to maintain the antisense construct. For a control, cells are also added to several wells that contain solvent used to dissolve test antibiotics but which contain no antibiotic. Cell growth is monitored continuously by incubation at 37° C. in a microtiter plate reader monitoring the OD₆₀₀ of the wells over an 18-hour period. The percent inhibition of growth produced by each concentration of antibiotic is calculated by comparing the rates of logarithmic growth against that exhibited by cells growing in media without antibiotic. A plot of percent inhibition against log[antibiotic concentration] allows extrapolation of an IC₅₀ value for each antibiotic.

[0661] F. Determining the Specificity of the Test Antibiotics

[0662] A comparison of the IC₅₀s generated by antibiotics of known mechanism of action under antisense induced and non-induced conditions allows the pathway in which a proliferation-required nucleic acid lies to be identified. If cells expressing an antisense nucleic acid complementary to a proliferation-required gene are selectively sensitive to an antibiotic acting via a particular pathway, then the gene against which the antisense acts is involved in the pathway on which the antibiotic acts.

[0663] G. Identification of Pathway in Which a Test Antibiotic Acts

[0664] As discussed above, the cell-based assay may also be used to determine the pathway against which a test antibiotic acts. In such an analysis, the pathways against which each member of a panel of antisense nucleic acids acts are identified as described above. A panel of cells, each containing an inducible vector which transcribes an antisense nucleic acid complementary to a gene in a known proliferation-required pathway from one of the fusion promoters described herein, is contacted with a test antibiotic for which it is desired to determine the pathway on which it acts under inducing and non-inducing conditions. If heightened sensitivity is observed in induced cells expressing antisense complementary to a gene in a particular pathway but not in induced cells expressing antisense complementary to genes in other pathways, then the test antibiotic acts against the pathway for which heightened sensitivity was observed.

[0665] One skilled in the art will appreciate that further optimization of the assay conditions, such as the concentration of inducer used to induce antisense expression and/or the growth conditions used for the assay (for example incubation temperature and media components) may further increase the selectivity and/or magnitude of the antibiotic sensitization exhibited.

[0666] The following example confirms the effectiveness of the methods described above.

EXAMPLE 22 Identification of the Biological Pathway in Which a Proliferation-Required Gene Lies

[0667] The effectiveness of the above assays was validated using proliferation-required genes from E. coli which were identified using procedures similar to those described above. Antibiotics of various chemical classes and modes of action were purchased from Sigma Chemicals (St. Louis, Mo.). Stock solutions were prepared by dissolving each antibiotic in an appropriate aqueous solution based on information provided by the manufacturer. The final working solution of each antibiotic contained no more than 0.2% (w/v) of any organic solvent. To determine their potency against a bacterial strain engineered for expression of an antisense complementary to a proliferation-required 50S ribosomal protein, each antibiotic was serially diluted two or three fold in growth medium supplemented with the appropriate antibiotic for maintenance of the anti-sense construct. At least ten dilutions were prepared for each antibiotic. 25 μL aliquots of each dilution were transferred to discrete wells of a 384-well microplate (the assay plate) using a multi-channel pipette. Quadruplicate wells were used for each dilution of an antibiotic under each treatment condition (plus and minus inducer). Each assay plate contained twenty wells for cell growth controls (growth media replacing antibiotic), ten wells for each treatment (plus and minus inducer, in this example IPTG). Assay plates were usually divided into the two treatments: half the plate containing induced cells and an appropriate concentrations of inducer (in this example IPTG) to maintain the state of induction, the other half containing non-induced cells in the absence of IPTG.

[0668] Cells for the assay were prepared as follows. Bacterial cells containing a construct, from which expression of antisense nucleic acid complementary to rplL and rplJ, which encode proliferation-required 50S ribosomal subunit proteins, is inducible in the presence of IPTG, were grown into exponential growth (OD₆₀₀ 0.2 to 0.3) and then diluted 1:100 into fresh media containing either 400 μM or 0 μM inducer (IPTG). These cultures were incubated at 37° C. for 2.5 hr. After a 2.5 hr incubation, induced and non-induced cells were respectively diluted into an assay medium at a final OD₆₀₀ value of 0.0004. The medium contained an appropriate concentration of the antibiotic for the maintenance of the antisense construct. In addition, the medium used to dilute induced cells was supplemented with 800 μM IPTG so that addition to the assay plate would result in a final IPTG concentration of 400 μM. Induced and non-induced cell suspensions were dispensed (25 μl/well) into the appropriate wells of the assay plate as discussed previously. The plate was then loaded into a plate reader, incubated at constant temperature, and cell growth was monitored in each well by the measurement of light scattering at 595 nm. Growth was monitored every 5 minutes until the cell culture attained a stationary growth phase. For each concentration of antibiotic, a percentage inhibition of growth was calculated at the time point corresponding to mid-exponential growth for the associated control wells (no antibiotic, plus or minus IPTG). For each antibiotic and condition (plus or minus IPTG), a plot of percent inhibition versus log of antibiotic concentration was generated and the IC₅₀ determined. A comparison of the IC₅₀ for each antibiotic in the presence and absence of IPTG revealed whether induction of the antisense construct sensitized the cell to the mechanism of action exhibited by the antibiotic. Cells which exhibited a significant (standard statistical analysis) numerical decrease in the IC₅₀ value in the presence of inducer were considered to have an increased sensitivity to the test antibiotic.

[0669] The above results demonstrate that induction of an antisense RNA to genes encoding 50S ribosomal subunit proteins results in a selective and highly significant sensitization of cells to antibiotics that inhibit ribosomal function and protein synthesis. The above results further demonstrate that induction of an antisense construct to an essential gene sensitizes a microorganism to compounds that interfere with that gene product's biological role. This sensitization is restricted to compounds that interfere with pathways associated with the targeted gene and its product. It will be appreciated that the cell-based assays described above may be implemented in Staphylococcus aureus and Enterococcus faecalis using proliferation-inhibiting antisense sequences under the control of a fusion promoter described herein.

[0670] The example below describes an analysis performed in Staphylococcus aureus.

EXAMPLE 23 Identification of the Biological Pathway in Which a Gene Required for Proliferation of Staphylococcus aureus Lies

[0671] Sensitized cells were also used to identify the pathway in which a gene required for proliferation of Staphylococcus aureus lies. Antibiotics of various chemical classes and modes of action were purchased from chemical suppliers, for example Sigma Chemicals (St. Louis, Mo.). Stock solutions were prepared by dissolving each antibiotic in an appropriate aqueous solution based on information provided by the manufacturer. The final working solution of each antibiotic contained no more than 0.2% (w/v) of any organic solvent.

[0672] To determine its potency against a bacterial strain containing an antisense nucleic acid complementary to the sequence encoding the Beta (>) subunit of gyrase (which is required for proliferation) under the control of the Xyl-T5 fusion promoter, each antibiotic was serially diluted two or three fold in growth medium supplemented with the appropriate antibiotic for maintenance of the anti-sense construct. At least ten dilutions were prepared for each antibiotic.

[0673] Aliquots (25 μL) of each dilution were transferred to discrete wells of a 384-well microplate (the assay plate) using a multi-channel pipette. Quadruplicate wells were used for each dilution of an antibiotic under each treatment condition (plus and minus inducer). Each assay plate retained twenty wells for cell growth controls (growth media replacing antibiotic), ten wells for each treatment (plus and minus inducer, in this example xylose). Half the assay plate contained induced cells (in this example Staphylococcus aureus cells) and appropriate concentrations of inducer (in this example xylose) to maintain the state of induction while the other half of the assay plate contained non-induced cells in the absence of inducer.

[0674] Cells of a bacterial clone containing a construct in which expression of antisense complementary to the sequence encoding the β subunit of gyrase under the control of a xylose inducible promoter were grown into exponential growth (OD₆₀₀ 0.2 to 0.3) and then diluted 1:100 into fresh media containing either 12 mM or 0 mM inducer (xylose). These cultures were incubated at 37° C. for 2.5 hr. The presence of inducer (xylose) in the medium initiates and maintains production of antisense RNA from the antisense construct, in this case. After a 2.5 hr incubation, induced and non-induced cells were respectively diluted into an assay medium containing an appropriate concentration of the antibiotic for the maintenance of the anti-sense construct. In addition, medium used to dilute induced cells was supplemented with 24 mM xylose so that addition to the assay plate would result in a final xylose concentration of 12 mM. The cells were diluted to a final OD₆₀₀ value of 0.0004.

[0675] Induced and non-induced cell suspensions were dispensed (25 :l/well) into the appropriate wells of the assay plate as discussed previously. The plate was then loaded into a plate reader and incubated at constant temperature while cell growth was monitored in each well by the measurement of light scattering at 595 nm. Growth was monitored every 5 minutes until the cell culture attained a stationary growth phase. For each concentration of antibiotic, a percentage inhibition of growth was calculated at the time point corresponding to mid-exponential growth for the associated control wells (no antibiotic, plus or minus xylose). For each antibiotic and condition (plus or minus xylose), plots of percent inhibition versus Log of antibiotic concentration were generated and IC_(50s) determined.

[0676] A comparison of each antibiotic's IC₅₀ in the presence and absence of inducer (in this example xylose) reveals whether induction of the antisense construct sensitized the cell to the antibiotic's mechanism of action. If the antibiotic acts against the P subunit of gyrase, the IC₅₀ of induced cells will be significantly lower than the IC₅₀ of uninduced cells.

[0677]FIG. 15 lists the antibiotics tested, their targets, and their fold increase in potency between induced cells and uninduced cells. As illustrated in FIG. 15, the potency of cefotaxime, cefoxitin, fusidic acid, lincomycin, tobramycin, trimethoprim and vancomycin, each of which act on targets other than the β subunit of gyrase, was not significantly different in induced cells as compared to uninduced cells. However, the potency of novobiocin, which acts against the β subunit of gyrase, was significantly different between induced cells and uninduced cells.

[0678] Thus, induction of an antisense nucleic acid complementary to the sequence encoding the β subunit of gyrase results in a selective and significant sensitization of Staphylococcus aureus cells to an antibiotic which inhibits the activity of this protein. Furthermore, the results demonstrate that induction of an antisense construct to an essential gene sensitizes a microorganism to compounds that interfere with that gene product's biological role. This sensitization is apparently restricted to compounds that interfere with the targeted gene and its product. It will be appreciated that the cell-based assays described above may be implemented in Enterococcus faecalis using proliferation-inhibiting antisense sequences under the control of a fusion promoter described herein. It will also be appreciated that the above cell-based assays employing the fusion promoters described herein can be implemented in other Gram-positive organims, including but not limited to, Bacillus anthracis, Clostridium botulinum, Clostridium difficile, Clostridium perfringens, Clostridium teteni, Corynebacterium diptheriae, Enterococcus faecalis, Enterococcus faecium, Lactococcus lactis, Listeria monocytogenes, Mycobacterium leprae, Mycobacterium tuberculosis, Nocardia asteroides, Staphylococcus aureus, Staphylococcus epidermidis, Staphylococcus xylosis, Streptococcus pneumoniae, Streptococcus mutans or any species falling within the genera of any of the above species.

[0679] Assays utilizing antisense constructs to essential genes can be used to identify compounds that interfere with the activity of those gene products. Such assays could be used to identify drug leads, for example antibiotics.

[0680] Assays utilizing antisense constructs to essential genes can be used to identify compounds that specifically interfere with the activity of multiple targets in a pathway. Such constructs can be used to simultaneously screen a sample against multiple targets in one pathway in one reaction (Combinatorial HTS).

[0681] Furthermore, as discussed above, panels of antisense construct-containing cells may be used to characterize the point of intervention of any compound affecting an essential biological pathway including antibiotics with no known mechanism of action.

[0682] The following example provides the a method for determining the pathway on which an antibiotic acts.

EXAMPLE 24 Cell-Based Assays Using Promoter Replacement or Operator Insertion

[0683] A gene encoding a gene product required for cellular proliferation is placed under the control of an inducible promoter, such as one of the inducible fusion promoters described herein, using the promoter replacement or operator insertion methods described above. Sensitized cells are generated by growing cells in which the gene encoding the gene product required for proliferation is under control of the inducible fusion promoter in medium containing varying concentrations of inducer which are less than the inducer concentration in which the promoter has maximal activity. The sensitized cells are contacted with candidate compounds to be tested for the ability to inhibit proliferation. In some embodiments, the candidate compounds may be compounds in a library of compounds generated using combinatorial chemistry techniques or compounds in a natural products library. The extent of proliferation of the sensitized cells is compared to the extent of proliferation of control cells in which the activity or level of the gene product is higher than in the sensitized cells. For example, the control cells may be cells in which the inducible fusion promoter is fully active. A compound which inhibits the growth of the sensitized cells to a substantially greater extent than the control cells is identified. This compound may be used as a drug or further optimized to generate a more potent compound.

[0684] It will be appreciated that the cell-based assays described above may be implemented in Staphylococcus aureus and Enterococcus faecalis using the fusion promoters or operators described herein. It will also be appreciated that the above cell-based assays employing the fusion promoters or operators described herein can be implemented in other Gram-positive organims, including but not limited to, Bacillus anthracis, Clostridium botulinum, Clostridium difficile, Clostridium perfringens, Clostridium teteni, Corynebacterium diptheriae, Enterococcus faecalis, Enterococcus faecium, Lactococcus lactis, Listeria monocytogenes, Mycobacterium leprae, Mycobacterium tuberculosis, Nocardia asteroides, Staphylococcus aureus, Staphylococcus epidermidis, Staphylococcus xylosis, Streptococcus pneumoniae, Streptococcus mutans or any species falling within the genera of any of the above species.

[0685] The example below describes an approach to determine if an antisense nucleic acid known to inhibit cell proliferation of one organism can successfully inhibit cell proliferation of a second organism.

EXAMPLE 25 Identification of Proliferation-Required Genes in Using Antisense Nucleic Acids from Heterologous Organisms

[0686] Once an antisense nucleic acid which inhibits the proliferation of a Gram-positive organism has been identified using the fusion promoters described herein, the antisense nucleic acid may be used to identify proliferation-required genes in heterologous organisms. In such methods, the antisense nucleic acid is transcribed in the heterologous organism and its ability to inhibit proliferation of the heterologous organism is determined.

[0687] In such embodiments, the antisense nucleic acids are inserted into expression vectors functional in the organisms in which the antisense nucleic acids are evaluated. Those skilled in the art will appreciate that a negative result in a heterologous microorganism does not mean that that microorganism is missing that gene nor does it mean that the gene is unessential. However, a positive result means that the heterologous microorganism contains a homologous gene which is required for proliferation of that microorganism. The homologous gene may be obtained using the methods described herein. Those cells that are inhibited by antisense may be used in cell-based assays as described herein for the identification and characterization of compounds in order to develop antibiotics effective in these microorganisms. Those skilled in the art will appreciate that an antisense molecule that works in the microorganism from which it was obtained will not always work in a heterologous microorganism.

[0688] The antisense nucleic acids that inhibit the growth of Staphylococcus aureus or Enterococcus faecalis, for example, may also be evaluated for their ability to inhibit the growth of microorganisms other than Staphylococcus aureus or Enterococcus faecalis. In particular, the ability of the antisense nucleic acid to inhibit the growth of Aspergillus fumigatus, Bacillus anthracis, Burkholderia cepacia, Campylobacter jejuni, Candida albicans, Chlamydia pneumoniae, Chlamydia trachomatus, Clostridium botulinum, Cryptococcus neoformans, Enterobacter cloacae, Enterococcusfaecalis, Escherichia coli, Haemophilus influenzae, Helicobacter pylori, Klebsiella pneumoniae, Mycobacterium leprae, Mycobacterium tuberculosis, Neisseria gonorrhoeae, Nocardia asteroides, Pseudomonas aeruginosa, Salmonella cholerasuis, Salmonella paratyphi, Salmonella typhi, Salmonella typhimurium, Staphylococcus aureus, Klebsiella pneumoniae, Moxarella catarrhalis, Shigella boydii, Shigella dysenteriae, Shigella flexneri, Shigella sonnet, Pseudomonas aeruginosa, Staphylococcus epidennidis, Streptococcus pneumoniae, Treponema pallidum, Yersinia pestis, or any species falling within the genera of any of the above species may be evaluated.

[0689] In such methods, expression vectors in which the expression of an antisense nucleic acid that inhibits the growth of Staphylococcus aureus or Enterococcus faecalis is under the control of an inducible promoter, such as a fusion promoter described herein, are introduced into the microorganisms in which they are to be evaluated. In some embodiments, the antisense nucleic acids may be evaluated in microorganisms which are closely related to Staphylococcus aureus or Enterococcus faecalis. The ability of these antisense nucleic acids to inhibit the growth of the related microorganisms in the presence of the inducer is then measured.

[0690] In one example, thirty-nine antisense nucleic acids which inhibited the growth of Staphylococcus aureus were identified using methods such as those described herein and were inserted into an expression vector such that their expression was under the control of a xylose-inducible Xyl-T5 promoter. A vector with a reporter gene under control of the Xyl-T5 promoter was used to show that expression from the Xyl-T5 promoter in Staphylococcus epidermidis was comparable to that in Staphylococcus aureus.

[0691] The vectors were introduced into Staphylococcus epidermidis by electroporation as follows: Staphylococcus epidermidis was grown in liquid culture to mid-log phase and then harvested by centrifugation. The cell pellet was resuspended in {fraction (1/3)} culture volume of ice-cold EP buffer (0.625 M sucrose, 1 mM MgCl₂, pH=4.0), and then harvested again by centrifugation. The cell pellet was then resuspended with {fraction (1/40)} volume EP buffer and allowed to incubate on ice for 1 hour. The cells were then frozen for storage at −80° C. For electroporation, 50 μl of thawed electrocompetent cells were combined with 0.5 μg plasmid DNA and then subjected to an electrical pulse of 10 kV/cm, 25 uFarads, 200 ohm using a BioRad gene pulser electroporation device. The cells were immediately resuspended with 200 μl outgrowth medium and incubated for 2 hours prior to plating on solid growth medium with drug selection to maintain the plasmid vector. Colonies resulting from overnight growth of these platings were selected, cultured in liquid medium with drug selection, and then subjected to dilution plating analysis as described above to test growth sensitivity in the presence of the inducer xylose.

[0692] The results are shown in Table I below. The first column indicates the Molecule Number of the Staphylococcus aureus antisense nucleic acid which was introduced into Staphylococcus epidermidis. The second column indicates whether the antisense nucleic acid inhibited the growth of Staphylococcus epidermidis, with a “+” indicating that growth was inhibited. Of the 39 Staphylococcus aureus antisense nucleic acids evaluated, 20 inhibited the growth of Staphylococcus epidermidis. TABLE I Sensitivity of Other Microorganisms to Antisense Nucleic Acids That Inhibit Proliferation of Staphylococcus aureus Mol. No. S. epidermidis SaXA005 + SaXA007 + SaXA008 + SaXA009 + SaXA010 + SaXA011 − SaXA012 − SaXA013 − SaXA015 + SaXA017 − SaXA022 + SaXA023 − SaXA024 − SaXA025 + SaXA026 + SaXA027 − SaXA027b − SaXA02c − SaXA028 − SaXA029 + SaXA030 + SaXA032 + SaXA033 + SaXA034 − SaXA035 + SaXA037 + SaXA039 − SaXA042 − SaXA043 − SaXA044 − SaXA045 + SaXA051 + SaXA053 − SaXA056b − SaXA059a + SaXA060 − SaXA061 + SaXA062 + SaXA063 − SaXA065 −

[0693] It will be appreciated above methods for identifying heterologous proliferation-required genes may be implemented in Enterococcus faecalis using the fusion promoters or operators described herein. It will be appreciated that the above methods for identifying heterologous proliferation-required genes using the fusion promoters described herein can be implemented in other Gram-positive organims, including but not limited to, Bacillus anthracis, Clostridium botulinum, Clostridium difficile, Clostridium perfringens, Clostridium teteni, Corynebacterium diptheriae, Enterococcus faecalis, Enterococcus faecium, Lactococcus lactis, Listeria monocytogenes, Mycobacterium leprae, Mycobacterium tuberculosis, Nocardia asteroides, Staphylococcus aureus, Staphylococcus epidermidis, Staphylococcus xylosis, Streptococcus pneumoniae, Streptococcus mutans or any species falling within the genera of any of the above species.

[0694] Those of skill in the art will appreciate that the methods disclosed herein provide a relatively easy, rapid, and economical way of identifying essential genes in an organism and identifying compounds that alter the expression levels of the identified essential genes.

[0695] One skilled in the art will appreciate that these methods and devices are and may be adapted to carry out the objects and obtain the ends and advantages mentioned, as well as those inherent therein. The methods, procedures, and devices described herein are presently representative of preferred embodiments and are exemplary and are not intended as limitations on the scope of the invention. Changes therein and other uses will occur to those skilled in the art which are encompassed within the spirit of the invention and are defined by the scope of the disclosure.

[0696] It will be apparent to one skilled in the art that varying substitutions and modifications may be made to the invention disclosed herein without departing from the scope and spirit of the invention.

[0697] Those skilled in the art recognize that the aspects and embodiments of the invention set forth herein may be practiced separate from each other or in conjunction with each other. Therefore, combinations of separate embodiments are within the scope of the invention as disclosed herein.

[0698] All patents and publications mentioned in the specification are indicative of the levels of those skilled in the art to which the invention pertains. All patents and publications are herein incorporated by reference to the same extent as if each individual publication was specifically and individually indicated to be incorporated by reference.

[0699] The invention illustratively described herein suitably may be practiced in the absence of any element or elements, limitation or limitations which is not specifically disclosed herein. Thus, for example, in each instance herein any of the terms “comprising”, “consisting essentially of” and “consisting of” may be replaced with either of the other two terms. The terms and expressions which have been employed are used as terms of description and not of limitation, and there is no intention that in the use of such terms and expressions indicates the exclusion of equivalents of the features shown and described or portions thereof. It is recognized that various modifications are possible within the scope of the invention disclosed. Thus, it should be understood that although the present invention has been specifically disclosed by preferred embodiments and optional features, modification and variation of the concepts herein disclosed may be resorted to by those skilled in the art, and that such modifications and variations are considered to be within the scope of this invention as defined by the disclosure.

[0700] In addition, where features or aspects of the invention are described in terms of Markush groups, those skilled in the art will recognize that the invention is also thereby described in terms of any individual member or subgroup of members of the Markush group.

1 68 1 556 DNA Artificial Sequence Sequence map showing the Xyl-T5 fusion promoter and associated DNA sequences 1 gaccgtcata aaaaatttat ttgctttcag gaaaattttt ctgtataata gattcaagtt 60 agtttgttta ttaaattaac caactaaaat gtagaattcc gagctcgagc cggtacccgg 120 ggatcctcta gagtcgacct gcagccaagc ttgggctttt cagcctgata cagattaaat 180 cagaacgcag aagcggtctg ataaaacaga atttgcctgg cggcagtagc gcggtggtcc 240 cacctgaccc catgccgaac tcagaagtga aacgccgtag cgccgatggt agtgtggggt 300 ctccccatgc gagagtaggg aactgccagg catcaaataa aacgaaaggc tcagtcgaaa 360 gactgggcct ttcgttttat ctgttgtttg tcggtgaacg ctctcctgag taggacaaat 420 ccgccgggag cggatttgaa cgttgcgaag caacggcccg gagggtggcg ggcaggacgc 480 ccgccataaa ctgccaggca tcaaattaag cagaaggcca tcctgacgga tggccttttt 540 gcgtttctac aaactc 556 2 168 DNA Artificial Sequence CP25 fusion promoter 2 attaaatata aataacattc aaatgaatat taaataatta cagcagtctg agttataaaa 60 tagatatctc ggaccgacat taaattcttg acagggagag ataggtttga tagaatataa 120 tagttgtctc gagagttagt ttgtttatta aattaaccaa ctggacga 168 3 20 DNA Artificial Sequence PCR primer 3 tggcattcta cgactataac 20 4 29 DNA Artificial Sequence PCR primer 4 gcgaattcta cattttagtt ggttaattt 29 5 94 DNA Escherichia coli Oligonucleotide Xyl-T5 5 gaccgtcata aaaaatttat ttgctttcag gaaaattttt ctgtataata gattcaagtt 60 agtttgttta ttaaattaac caactaaaat gtag 94 6 95 DNA Artificial Sequence Oligonucleotide Xyl-T5 complement 6 aattctacat tttagttggt taatttaata aacaaactaa cttgaatcta ttatacagaa 60 aaattttcct gaaagcaaat aaatttttta tgacg 95 7 10929 DNA Artificial Sequence Vector pEP25 7 aagcttgcct gcaggtcgac gataaccaca taacagtcat aaaaccactc ctttttaaca 60 aactttatca caagaaatat tttggcattc tacgactata acttaaattt atatttttta 120 ctttataata tataattgat tatagaataa tgttgctcat atcgtttgcc aacatctagt 180 actcaaatta cactatgtta cacttggtaa tattaaccga acttcccctg tccaaattag 240 ataagaggta ataataaatg gaaaataatt ttatagtaaa tgaaaatgag aagcgtgtat 300 taaaacaaat tttcaataac agcaatattt cacgaacaca aatatcgaag aatttagaac 360 ttaataaagc tactatttct aacattctga acaacttaaa acacaagagt ttagttaatg 420 aagtaggaga aggtaatagt actaaaagtg gtggacgaaa gcctatttta ctcgaaatta 480 accaaaaata tggctactat atttctatgg atttaacata tgattccgtt gaattaatgt 540 acaactactt tgatgctact atattaaagc aagattccta cgaattaaat gataaaaatg 600 taagcagtat attacaaatt ttaaaatcta atataaacgt ctcagaaaaa tatgatacgt 660 tatatgggtt acttggtata tctatatcca tacacggtat cgttgacgat gagcaaaaca 720 taatcaatct tccttttcat aaaaatgaga aacgcacatt taccgatgaa ttaaagtcat 780 tcacaaatgt tcctgtcgtt atagaaaatg aagcaaattt atcagcgcta tatgaaaaaa 840 gtttatatat taattcaaac ataaataatt tgattacttt aagtattcac aagggtatag 900 gcgctggcat cataataaat aaaaaacttt atcgtggctc aaatggagag gctggagaga 960 taggtaagac attggttttg gaatctataa ataacaatga caacaaatat tataaaatcg 1020 aagatatatg ctcccaagac gctttaatac agaaaataaa taataggttg ggcgtcacat 1080 tgacgtttac agaactaatc caatattaca acgaaggaaa ttcaattgtt gctcatgaaa 1140 ttaaacaatt tattaataaa atgacagttc tgattcataa tttgaataca caatttaacc 1200 cagacgctat ttatattaac tgtcctttaa ttaatgaatt accaaatatt ttaaatgaaa 1260 ttaaagagca attctcctgt ttttctcaag gcagtccaat tcaattacat ttaactacta 1320 atgtaaaaca agctacttta ttgggtggca ctttagcaat aatgcaaaaa acattaaata 1380 taaataacat tcaaatgaat attaaataat tacagcagtc tgagttataa aatagatatc 1440 tcggaccgct ttggcagttt attcttgaca tgtagtgagg gggctggtat aatcacatag 1500 ctcgagagtt agtttgttta ttaaattaac caactggatc cggaggaaca gctatgacca 1560 tgattacgga ttcactggcc gtcgttttac aacgtcgtga ctgggaaaac cctggcgtta 1620 cccaacttaa tcgccttgca gcacatcccc ctttcgccag ctggcgtaat agcgaagagg 1680 cccgcaccga tcgcccttcc caacagttgc gcagcctgaa tggcgaatgg cgctttgcct 1740 ggtttccggc accagaagcg gtgccggaaa gctggctgga gtgcgatctt cctgaggccg 1800 atactgtcgt cgtcccctca aactggcaga tgcacggtta cgatgcgccc atctacacca 1860 acgtgaccta tcccattacg gtcaatccgc cgtttgttcc cacggagaat ccgacgggtt 1920 gttactcgct cacatttaat gttgatgaaa gctggctaca ggaaggccag acgcgaatta 1980 tttttgatgg cgttaactcg gcgtttcatc tgtggtgcaa cgggcgctgg gtcggttacg 2040 gccaggacag tcgtttgccg tctgaatttg acctgagcgc atttttacgc gccggagaaa 2100 accgcctcgc ggtgatggtg ctgcgctgga gtgacggcag ttatctggaa gatcaggata 2160 tgtggcggat gagcggcatt ttccgtgacg tctcgttgct gcataaaccg actacacaaa 2220 tcagcgattt ccatgttgcc actcgcttta atgatgattt cagccgcgct gtactggagg 2280 ctgaagttca gatgtgcggc gagttgcgtg actacctacg ggtaacagtt tctttatggc 2340 agggtgaaac gcaggtcgcc agcggcaccg cgcctttcgg cggtgaaatt atcgatgagc 2400 gtggtggtta tgccgatcgc gtcacactac gtctgaacgt cgaaaacccg aaactgtgga 2460 gcgccgaaat cccgaatctc tatcgtgcgg tggttgaact gcacaccgcc gacggcacgc 2520 tgattgaagc agaagcctgc gatgtcggtt tccgcgaggt gcggattgaa aatggtctgc 2580 tgctgctgaa cggcaagccg ttgctgattc gaggcgttaa ccgtcacgag catcatcctc 2640 tgcatggtca ggtcatggat gagcagacga tggtgcagga tatcctgctg atgaagcaga 2700 acaactttaa cgccgtgcgc tgttcgcatt atccgaacca tccgctgtgg tacacgctgt 2760 gcgaccgcta cggcctgtat gtggtggatg aagccaatat tgaaacccac ggcatggtgc 2820 caatgaatcg tctgaccgat gatccgcgct ggctaccggc gatgagcgaa cgcgtaacgc 2880 gaatggtgca gcgcgatcgt aatcacccga gtgtgatcat ctggtcgctg gggaatgaat 2940 caggccacgg cgctaatcac gacgcgctgt atcgctggat caaatctgtc gatccttccc 3000 gcccggtgca gtatgaaggc ggcggagccg acaccacggc caccgatatt atttgcccga 3060 tgtacgcgcg cgtggatgaa gaccagccct tcccggctgt gccgaaatgg tccatcaaaa 3120 aatggctttc gctacctgga gagacgcgcc cgctgatcct ttgcgaatac gcccacgcga 3180 tgggtaacag tcttggcggt ttcgctaaat actggcaggc gtttcgtcag tatccccgtt 3240 tacagggcgg cttcgtctgg gactgggtgg atcagtcgct gattaaatat gatgaaaacg 3300 gcaacccgtg gtcggcttac ggcggtgatt ttggcgatac gccgaacgat cgccagttct 3360 gtatgaacgg tctggtcttt gccgaccgca cgccgcatcc agcgctgacg gaagcaaaac 3420 accagcagca gtttttccag ttccgtttat ccgggcaaac catcgaagtg accagcgaat 3480 acctgttccg tcatagcgat aacgagctcc tgcactggat ggtggcgctg gatggtaagc 3540 cgctggcaag cggtgaagtg cctctggatg tcgctccaca aggtaaacag ttgattgaac 3600 tgcctgaact accgcagccg gagagcgccg ggcaactctg gctcacagta cgcgtagtgc 3660 aaccgaacgc gaccgcatgg tcagaagccg ggcacatcag cgcctggcag cagtggcgtc 3720 tggcggaaaa cctcagtgtg acgctccccg ccgcgtccca cgccatcccg catctgacca 3780 ccagcgaaat ggatttttgc atcgagctgg gtaataagcg ttggcaattt aaccgccagt 3840 caggctttct ttcacagatg tggattggcg ataaaaaaca actgctgacg ccgctgcgcg 3900 atcagttcac ccgtgcaccg ctggataacg acattggcgt aagtgaagcg acccgcattg 3960 accctaacgc ctgggtcgaa cgctggaagg cggcgggcca ttaccaggcc gaagcagcgt 4020 tgttgcagtg cacggcagat acacttgctg atgcggtgct gattacgacc gctcacgcgt 4080 ggcagcatca ggggaaaacc ttatttatca gccggaaaac ctaccggatt gatggtagtg 4140 gtcaaatggc gattaccgtt gatgttgaag tggcgagcga tacaccgcat ccggcgcgga 4200 ttggcctgaa ctgccagctg gcgcaggtag cagagcgggt aaactggctc ggattagggc 4260 cgcaagaaaa ctatcccgac cgccttactg ccgcctgttt tgaccgctgg gatctgccat 4320 tgtcagacat gtataccccg tacgtcttcc cgagcgaaaa cggtctgcgc tgcgggacgc 4380 gcgaattgaa ttatggccca caccagtggc gcggcgactt ccagttcaac atcagccgct 4440 acagtcaaca gcaactgatg gaaaccagcc atcgccatct gctgcacgcg gaagaaggca 4500 catggctgaa tatcgacggt ttccatatgg ggattggtgg cgacgactcc tggagcccgt 4560 cagtatcggc ggaattccag ctgagcgccg gtcgctacca ttaccagttg gtctggtgtc 4620 aaaaataata atgtcgaggg taccgagctc gaattccttt tagtccagcc gagcgcttag 4680 tgggaatttg taccccttat cgatacaaat tccccgtagg cgctagggac ctctttagct 4740 ccttggaagc tgtcagtagt atacctaata atttatctac attcccttaa gtaacgtgta 4800 actttccaaa tttacaaaag cgactcatag aattatttcc tcccgttaaa taatagataa 4860 ctattaaaaa tagacaatac ttgctcataa gtaacggtac ttaaattgtt tactttggcg 4920 tgtttcattg cttgatgaaa ctgattttta gtaaacagtt gacgatattc tcgattgacc 4980 cattttgaaa caaagtacgt atatagcttc caatatttat ctggaacatc tgtggtatgg 5040 cgggtaagtt ttattaagac actgtttact tttggtttag gatgaaagca ttccgctggc 5100 agcttaagca attgctgaat cgagacttga gtgtgcaaga gcaaccctag tgttcggtga 5160 atatccaagg tacgcttgta gaatccttct tcaacaatca gatagatgtc agacgcatgg 5220 ctttcaaaaa ccactttttt aataatttgt gtgcttaaat ggtaaggaat actcccaaca 5280 attttatacc tctgtttgtt agggaattga aactgtagaa tatcttggtg aattaaagtg 5340 acacgagtat tcagttttaa tttttctgac gataagttga atagatgact gtctaattca 5400 atagacgtta cctgtttact tattttagcc agtttcgtcg ttaaatgccc tttacctgtt 5460 ccaatttcgt aaacggtatc ggtttctttt aaattcaatt gttttattat ttggttgagt 5520 actttttcac tcgttaaaaa gttttgagaa tattttatat ttttgttcat gtaatcactc 5580 cttcttaatt acaaattttt agcatctaat ttaacttcaa ttcctattat acaaaatttt 5640 aagatactgc actatcaaca cactcttaag tttgcttcta agtcttattt ccataacttc 5700 ttttacgttt ccgccattct ttgctgtttc gatttttatg atatggtgca agtcagcacg 5760 aacacgaacc gtcttatctc ccattatatc tttttttgca ctgattggtg tatcatttcg 5820 tttttctttt tgtgcgccta aatttcccac aatcactcac ttctttctat ttcttcttat 5880 tcttatttta tcatcaacaa tcacaaatca cttgtgattt gtgataagtg atttgtgatt 5940 aatatataaa agccctcttt aaagggcttt tatgtttatt ttgagaaaga tataaaatca 6000 atatatccct tttctccgat ttttacaacg gcattgtagg actttctatc tttcgttttg 6060 attcctttta ccagggtttc ttttccctct agtaattctt ttacatttgt tttggtgagt 6120 tttttctttc taaaatgttc agctaaagta aacttacatt caggataatt tgaacaacca 6180 taaaacgatt tttttaatac aatattgttg ccacacttag gacattttcc tacaatactt 6240 ttttctgctt ctttttcttt ctgttcctgg taatcagaaa aatttagttt ttctatatcg 6300 ttaggtacag cttccagtaa atgaacaatg aattttttga tattcgtaat aaagttctct 6360 tgattgcctt ctcttttacc gatttttttt aaatacgttt cccatttagc cgtcatttca 6420 gcactcgtta aaaggtgctg actttcaact gcctggcaca ataattttcc tttttcagtt 6480 acaacaagct ctgatttcac tttttgcatt ctacaaactg cataactcat atgtaaatcg 6540 ctccttttta ggtggcacaa atgtgaggca ttttcgctct ttccggcgag gctagttacc 6600 cttaagttat tggtatgact ggttttaagc gcaaaaaaag ttgctttttc gtacctatta 6660 atgtatcgtt agaaaaccga ctgtaaaaag tacagtcggc attatctcat attataaaag 6720 ccagtcatta ggcctatctg acaattcctg aatagagttc ataaacaatc ctgcatgata 6780 accatcacaa acagaatgat gtacctgtaa agatagcggt aaatatattg aattaccttt 6840 attaatgaat tttcctgctg taataatggg tagaaggtaa ttactattat tattgatatt 6900 taagttaaac ccagtaaatg aagtccatgg aataatagaa agagaaaaag cattttcagg 6960 tataggtgtt ttgggaaaca atttccccga accattatat ttctctacat cagaaaggta 7020 taaatcataa aactctttga agtcattctt tacaggagtc caaataccag agaatgtttt 7080 agatacacca tcaaaaattg tataaagtgg ctctaactta tcccaataac ctaactctcc 7140 gtcgctattg taaccagttc taaaagctgt atttgagttt atcacccttg tcactaagaa 7200 aataaatgca gggtaaaatt tatatccttc ttgttttatg tttcggtata aaacactaat 7260 atcaatttct gtgggtatac taaaagtcgt ttggtggttc aaataatgat taaatatctc 7320 ttttctcttc caattgtcta aatcaatttt attaaagttc atttgatatg cctcctaaat 7380 ttttatctaa agtgaattta ggaggcttac ttgtctgctt tcttcattag aatcaatcct 7440 tttttaaaag tcaatattat tgtaacataa atatatattt taaaaatatc ccactttatc 7500 caattttcgt ttgttgaact aatgggtgct ttagttgaag aataaaagac cacattaaaa 7560 aatgtggtct tttgtgtttt tttaaaggat ttgagcgtag cgaaaaatcc ttttctttct 7620 tatcttgata ataagggtaa ctattgccgg gatagactgt aacattctca cgcataaaat 7680 cccctttcat tttctaatgt aaatctatta ccttattatt aattcaattc gctcataatt 7740 aatccttttt cttattacgc aaaatggccc gatttaagca caccctttat tccgttaatg 7800 cgccatgaca gccatgataa ttactaatac taggagaagt taataaatac gtaaccaaca 7860 tgattaacaa ttattagagg tcatcgttca aaatggtatg cgttttgaca catccactat 7920 atatccgtgt cgttctgtcc actcctgaat cccattccag aaattctcta gcgattccag 7980 aagtttctca gagtcggaaa gttgaccaga cattacgaac tggcacagat ggtcataacc 8040 tgaaggaaga tctgattgct taactgcttc agttaagacc gaagcgctcg tcgtataaca 8100 gatgcgatga tgcagaccaa tcaacatggc acctgccatt gctacctgta cagtcaagga 8160 tggtagaaat gttgtcggtc cttgcacacg aatattacgc catttgcctg catattcaaa 8220 cagctcttct acgataaggg cacaaatcgc atcgtggaac gtttgggctt ctaccgattt 8280 agcagtttga tacactttct ctaagtatcc acctgaatca taaatcggca aaatagagaa 8340 aaattgacca tgtgtaagcg gccaatctga ttccacctga gatgcataat ctagtagaat 8400 ctcttcgcta tcaaaattca cttccacctt ccactcaccg gttgtccatt catggctgaa 8460 ctctgcttcc tctgttgaca tgacacacat catctcaata tccgaatagg gcccatcagt 8520 ctgacgacca agagagccat aaacaccaat agccttaaca tcatccccat atttatccaa 8580 tattcgttcc ttaatttcat gaacaatctt cattctttct tctctagtca ttattattgg 8640 tccattcact attctcattc ccttttcaga taattttaga tttgcttttc taaataagaa 8700 tatttggaga gcaccgttct tattcagcta ttaataactc gtcttcctaa gcatccttca 8760 atccttttaa taacaattat agcatctaat cttcaacaaa ctggcccgtt tgttgaacta 8820 ctctttaata aaataatttt tccgttccca attccacatt gcaataatag aaaatccatc 8880 ttcatcggct ttttcgtcat catctgtatg aatcaaatcg ccttcttctg tgtcatcaag 8940 gtttaatttt ttatgtattt cttttaacaa accaccatag gagattaacc ttttacggtg 9000 taaaccttcc tccaaatcag acaaacgttt caaattcttt tcttcatcat cggtcataaa 9060 atccgtatcc tttacaggat attttgcagt ttcgtcaatt gccgattgta tatccgattt 9120 atatttattt ttcggtcgat tttttattaa aacgtctcaa aatcgtttct gagacgtttt 9180 agcgtttatt tcgtttagtt atcggcataa tcgttaaaac aggcgttatc gtagcgtaaa 9240 agcccttgag cgtagcgtgg ctttgcagcg aagatgttgt ctgttagatt atgaaagccg 9300 atgactgaat gaaataataa gcgcagcgtc cttctatttc ggttggagga ggctcaaggg 9360 agtttgaggg aatgaaattc cctcatgggt ttgattttaa aaattgcttg caattttgcc 9420 gagcggtagc gctggaaaat ttttgaaaaa aatttggaat ttggaaaaaa atggggggga 9480 aaggaagcga attttgcttc cgtactacga ccccccatta agtgccgagt gccaattttt 9540 gtgccaaaaa cgctctatcc caactggctc aagggtttga ggggtttttc aatcgccaac 9600 gaatcgccaa cgttttcgcc aacgtttttt ataaatctat atttaagtag ctttattgtt 9660 gtttttatga ttacaaagtg atacactaat tttataaaat tatttgattg gagtttttta 9720 aatggtgatt tcagaatcga aaaaaagagt tatgatttct ctgacaaaag agcaagataa 9780 aaaattaaca gatatggcga aacaaaaagg tttttcaaaa tctgcgggtt gcgggcgtta 9840 gctatagaag aatatgcaag aaaggaatca gaacaaaaaa aataagcgaa agctcgcgtt 9900 tttagaagga tacgagtttt cgctacttgt ttttgataag gtaattatat catggctatt 9960 aaaaatacta aagctagaaa ttttggattt ttattatatc ctgactcaat tcctaatgat 10020 tggaaagaaa aattagagag tttgggcgta tctatggctg tcagtccttt acacgatatg 10080 gacgaaaaaa aagataaaga tacatggaat agtagtgatg ttatacgaaa tggaaagcac 10140 tataaaaaac cacactatca cgttatatat attgcacgaa atcctgtaac aatagaaagc 10200 gttaggaaca agattaagcg aaaattgggg aatagttcag ttgctcatgt tgagatactt 10260 gattatatca aaggttcata tgaatatttg actcatgaat caaaggacgc tattgctaag 10320 aataaacata tatacgacaa aaaagatatt ttgaacatta atgattttga tattgaccgc 10380 tatataacac ttgatgaaag ccaaaaaaga gaattgaaga atttactttt agatatagtg 10440 gatgactata atttggtaaa tacaaaagat ttaatggctt ttattcgcct taggggagcg 10500 gagtttggaa ttttaaatac gaatgatgta aaagatattg tttcaacaaa ctctagcgcc 10560 tttagattat ggtttgaggg caattatcag tgtggatata gagcaagtta tgcaaaggtt 10620 cttgatgctg aaacggggga aataaaatga caaacaaaga aaaagagtta tttgctgaaa 10680 atgaggaatt aaaaaaagaa attaaggact taaaagagcg tattgaaaga tacagagaaa 10740 tggaagttga attaagtaca acaatagatt tattgagagg agggattatt gaataaataa 10800 aagcccccct gacgaaagtc gaagggggct tttattttgg tttgatgttg cgattaatag 10860 caatacgatt gcaataaaca aaatgatcga tgaaagcaat gtgagagaaa aatatgttcc 10920 taaaacaac 10929 8 12739 DNA Artificial Sequence Vector pEPEF1 8 aagcttgcct gcaggcgacg ataaccacat aacagtcata aaaccactcc tttttaacaa 60 actttatcac aagaaatatt ttggcattct acgactataa cttaaattta tattttttac 120 tttataatat ataattgatt atagaataat gttgctcata tcgtttgcca acatctagta 180 ctcaaattac actatgttac acttggtaat attaaccgaa cttcccctgt ccaaattaga 240 taagaggtaa taataaatgg aaaataattt tatagtaaat gaaaatgaga agcgtgtatt 300 aaaacaaatt ttcaataaca gcaatatttc acgaacacaa atatcgaaga atttagaact 360 taataaagct actatttcta acattctgaa caacttaaaa cacaagagtt tagttaatga 420 agtaggagaa ggtaatagta ctaaaagtgg tggacgaaag cctattttac tcgaaattaa 480 ccaaaaatat ggctactata tttctatgga tttaacatat gattccgttg aattaatgta 540 caactacttt gatgctacta tattaaagca agattcctac gaattaaatg ataaaaatgt 600 aagcagtata ttacaaattt taaaatctaa tataaacgtc tcagaaaaat atgatacgtt 660 atatgggtta cttggtatat ctatatccat acacggtatc gttgacgatg agcaaaacat 720 aatcaatctt ccttttcata aaaatgagaa acgcacattt accgatgaat taaagtcatt 780 cacaaatgtt cctgtcgtta tagaaaatga agcaaattta tcagcgctat atgaaaaaag 840 tttatatatt aattcaaaca taaataattt gattacttta agtattcaca agggtatagg 900 cgctggcatc ataataaata aaaaacttta tcgtggctca aatggagagg ctggagagat 960 aggtaagaca ttggttttgg aatctataaa taacaatgac aacaaatatt ataaaatcga 1020 agatatatgc tcccaagacg ctttaataca gaaaataaat aataggttgg gcgtcacatt 1080 gacgtttaca gaactaatcc aatattacaa cgaaggaaat tcaattgttg ctcatgaaat 1140 taaacaattt attaataaaa tgacagttct gattcataat ttgaatacac aatttaaccc 1200 agacgctatt tatattaact gtcctttaat taatgaatta ccaaatattt taaatgaaat 1260 taaagagcaa ttctcctgtt tttctcaagg cagtccaatt caattacatt taactactaa 1320 tgtaaaacaa gctactttat tgggtggcac tttagcaata atgcaaaaaa cattaaatat 1380 aaataacatt caaatgaata ttaaataatt acagcagtct gagttataaa atagatatct 1440 cggaccgctt tggcagttta ttcttgacat gtagtgaggg ggctggtata atcacatagt 1500 actgttctcg agagttagtt tgtttattaa attaaccaac tggatcccgg gtctagatta 1560 gggtaacttt gaaaggatat tcctcatgca agctaatctt caatggttag atgacccaga 1620 agtcttccgg gtcaaccaat tacctgcaca tagtgatcac cattattatc acgacacagc 1680 agaattcaaa acgggtagtc gcttcatcaa gagtctcaat ggcgcttggc gttttaactt 1740 cgccaagaca ccggctgaac gcccagttga tttttatcaa cccgatttcg atgcaaccga 1800 ctttgatacg attcaagttc ccggtcatat tgaactagcc ggctatggtc aaattcaata 1860 cattaacacg ctatacccat gggaaggtaa aatttaccgt cgcccaccgt ataccctcaa 1920 tcaagatcaa ttaacaccag gcctattcag cgacgctgcg gacaacaccg tcggctcgta 1980 cctcaaaacc ttcgatctcg acgatgtttt taaagggcaa cgtattatca ttcagttcca 2040 aggggtagaa gaagccctgt acgtctggtt aaatggccat tttattggct actctgaaga 2100 tagtttcacc ccttcagaat ttgatttgac gccgtatatt caggaccaag gtaacgtttt 2160 agcggttcgg gtctacaaac acagtactgc tgcctttatt gaagaccaag atatgttccg 2220 tttctctggt attttccgtg acgtcaatat actggcggag cctgctagcc atattactga 2280 tttggacatc cgaccagttc caaatgccaa tctcaaaagt ggtgagctca acatcactac 2340 taaagtaacc ggcgaaccag ccactttagc gctgaccgtt aaagaccatg acgggcgagt 2400 actgacgagt caaacgcaaa ccggtagtgg cagtgtaacc tttgatacta tgttattcga 2460 ccaactgcac ttgtggtcac cacaaacgcc gtatctctat caattgacaa ttgaagttta 2520 cgatgctgat caccaactct tggaagtcgt cccatatcag tttgggttcc ggacggtcga 2580 gctgcgcgat gacaaagtca tttacgtcaa caataaacgg ttggtgatca acggggttaa 2640 ccggcacgaa tggaacgccc acaccggtcg cgttatcagt atggctgata tgcgcgctga 2700 tatccaaacc atgttagcta acaatatcaa tgccgatcgg acctgccatt atcctgatca 2760 attaccttgg tatcaattat gtgacgaggc cggtatctac ctaatggccg aaaccaacct 2820 cgaatcgcac gggtcatggc aaaagatggg ggctatcgag ccttcttaca atgttcctgg 2880 cgataatcca cactggccag cagcggtgat cgaccgggcc cgttcaaact acgaatggtt 2940 taaaaaccac ccatcaatca ttttttggtc acttggcaat gaatcgtatg ctggcgaaga 3000 tatcgcggcg atgcaggctt tttataaaga acacgatgat tcacgactcg tccactacga 3060 aggcgttttc tacacaccag aattaaaaga tcgcatttct gatgttgaaa gtcggatgta 3120 cgaaaagccc caaaatattg tagcttactt ggaagataac ccaaccaaac ctttcctaaa 3180 ttgtgaatat atgcatgaca tggggaattc tctgggcggt atgcaatcat ataatgattt 3240 gatcgacaag tatccaatgt atcaaggtgg ctttatttgg gactttattg atcaagcact 3300 cttcgttcat gacccaatca ccgaccaaga cgtgctccgg tatggcggtg atttcgacga 3360 acgccactcc gattatgcat tctccggtaa cggcttaatg tttgccgacc ggacaccaaa 3420 accagcaatg caagaggtga aatattatta tggcttacac aaataatcaa ctacacgtta 3480 tttacggcga cgggagctta ggactacagg gggctaattt ccactacctc tttagctacg 3540 aacgtggggg acttgaatca ctcgtcgtca acgataaaga gtggctctat cgtacaccca 3600 cgcccatgtt ttggcgggcg acaaccgata atgatcacgg tagcggcttt tcagtcaaat 3660 ccgcacagtg gtacgcggcc gataagttct caacttgtca agatatcgaa ttgacggttg 3720 acgaccaacc agtcacaccg ttaccaatcg cgccactcaa taacaaatac acggatcacg 3780 aaatcgccac gaaagtctct ctggcttacc acttcgttac cacgaccgtt cctagtacca 3840 tcgtcacagt gacttatacg gtgacagcag acggtcagat caatatcgcc acccattata 3900 gcggtcagtc tgatttgcca gagctacccg catttggtct gcggtttatc atgccaacta 3960 ccgcgaccgg cttcgactat accggtttgt ccggtgagac ttatcctgac cggctggctg 4020 gcgcaacaca cgggcaattc cacgttgaca gtctgccagt cacaccatac ttggtcccac 4080 aagaatgcgg catgcacatg caaactgaac aagtgacagt aacgcgatca acaacacaaa 4140 ataacgctga ccacgacaac acaccattca gtttaacatt tagccaaacc gatgcaccat 4200 tcgccttcag ctgccttccc tataccgctg ctgaactaga aaacgcaaca cacatggaag 4260 aattaccatt agcacggcga acggtcttat caatctacgg tgccgttcgt ggggtcggtg 4320 gcattgacag ttggggaacg gacgtagaag ccccatatca tatcctcgct aatcaagaca 4380 ttgacttcag ctttaacatt catttctaaa aattaatttg atttcaaaag aatcgctccg 4440 gcagttattt gccagagcgt tcttttaata tgttttatgg ctgagcttta gtcctttgaa 4500 gtgggggggg gggggggggg gggggggggg gggggggggg gggggggggg gggggggggg 4560 gggggggggg gggggggggg gggggggggg gggggggggg gggggggggg gggggggggg 4620 gggggggggg gggggggggg gggggggggg gggggggggg gggggggggg gggggggggg 4680 gggggggggg gggggggggg gggggggggg gggggggggg gggggggggg gggggggggg 4740 gggggggggg gggggggggg gggggggggg gggggggggg gggggggggg gggggggggg 4800 gggggggggg gggggggggg gggggggggg gggggggggg gggggggggg gggggggggg 4860 gggggggggg gggggggggg gggggggggg gggggggggg gggggggggg gggggggggg 4920 gggggggggg gggggggggg gggggggggg gggggggggg gggggggggg gggggggggg 4980 gggggggggg gggggggggg gggggggggg gggggggggg gggggggggg gggggggggg 5040 gggggggggg gggggggggg gggggggggg gggggggggg gggggggggg gggggggggg 5100 gggggggggg gggggggggg gggggggggg gggggggggg gggggggggg gggggggggg 5160 gggggggggg gggggggggg gggggggggg gggggggggg gggggggggg gggggggggg 5220 gggggggggg gggggggggg gggggggggg gggggggggg gggggggggg gggggggggg 5280 gggggggggg gggggggggg gggggggggg gggggggggg gggggggggg gggggggggg 5340 gggggggggg ggggtcgacc gatgcccttg agagccttca acccagtcag ctccttccgg 5400 tgggcgcggg gcatgactat cgtcgccgca cttatgactg tcttctttat catgcaactc 5460 gtaggacagg tgccggcagc gctctgggtc attttcggcg aggaccgctt tcgctggagc 5520 gcgacgatga tcggcctgtc gcttgcggta ttcggaatct tgcacgccct cgctcaagcc 5580 ttcgtcactg gtcccgccac caaacgtttc ggcgagaagc aggccattat cgccggcatg 5640 gcggccgacg cgctgggcta cgtcttgctg gcgttcgcga cgcgaggctg gatggccttc 5700 cccattatga ttcttctcgc ttccggcggc atcgggatgc ccgcgttgca ggccatgctg 5760 tccaggcagg tagatgacga ccatcaggga cagcttcaag gatcgctcgc ggctcttacc 5820 agcctaactt cgatcactgg accgctgatc gtcacggcga tttatgccgc ctcggcgagc 5880 acatggaacg ggttggcatg gattgtaggc gccgccctat accttgtctg cctccccgcg 5940 ttgcgtcgcg gtgcatggag ccgggccacc tcgacctgaa tggaagccgg cggcacctcg 6000 ctaacggatt caccactcca agaattggag ccaatcaatt cttgcggaga actgtgaatg 6060 cgcaaaccaa cccttggcag aacatatcca tcgcgtccgc catctccagc agccgcacgc 6120 ggcgcatctc gggcagcgtt gggtcctggc cacgggtgcg catgatcgtg ctcctgtcgt 6180 tgaggacccg gctaggctgg cggggttgcc ttactggtta gcagaatgaa tcaccgatac 6240 gcgagcgaac gtgaagcgac tgctgctgca aaacgtctgc gacctgagca acaacatgaa 6300 tggtcttcgg tttccgtgtt tcgtaaagtc tggaaacgcg gaagtcccct acgtgctgct 6360 gaagttgccc gcaacagaga gtggaaccaa ccggtgatac cacgatacta tgactgagag 6420 tcaacgccat gagcggcctc atttcttatt ctgagttaca acagtccgca ccgctgtccg 6480 gtagctcctt ccggtgggcg cggggcatga ctatcgtcgc cgcacttatg actgtcttct 6540 ttatcatgca actcgtagga caggtgccgg cagcgcccaa cagtcccccg gccacggggc 6600 ctgccaccat acccacgccg aaacaagcgc cctgcaccat tatgttccgg atctgcatcg 6660 caggatgctg ctggctaccc tgtggaacac ctacatctgt attaacgaag cgctaaccgt 6720 ttttatcagg ctctgggagg cagaataaat gatcatatcg tcaattatta cctccacggg 6780 gagagcctga gcaaactggc ctcaggcatt tgagaagcac acggtcacac tgcttccggt 6840 agtcaataaa ccggtaaacc agcaatagac ataagcggct atttaacgac cctgccctga 6900 accgacgacc gggtcgaatt tgctttcgaa tttctgccat tcatccgctt attatcactt 6960 attcaggcgt agcaccaggc gtttaagggc accaataact gccttaaaaa aattacgccc 7020 cgccctgcca ctcatcgcag tactgttgta attcattaag cattctgccg acatggaagc 7080 catcacagac ggcatgatga acctgaatcg ccagcggcat cagcaccttg tcgccttgcg 7140 tataatattt gcccatggtg aaaacggggg cgaagaagtt gtccatattg gccacgttta 7200 aatcaaaact ggtgaaactc acccagggat tggctgagac gaaaaacata ttctcaataa 7260 accctttagg gaaataggcc aggttttcac cgtaacacgc cacatcttgc gaatatatgt 7320 gtagaaactg ccggaaatcg tcgtggtatt cactccagag cgatgaaaac gtttcagttt 7380 gctcatggaa aacggtgtaa caagggtgaa cactatccca tatcaccagc tcaccgtctt 7440 tcattgccat acgaattcag aggtttgatg actttgacct ttggttttaa atttttgaaa 7500 aaaataaaaa aaggcgaagc ctattatata tttatcatat atattttaat cttttgttct 7560 tttgcgtgaa aaaaaaggca gtgttttcgc tagttataga aattaaacag tcacaaaaat 7620 cgatgtatac agtcacaaaa atcgattttt gtgactctat gcatcgatgt atacgagtca 7680 caaaaatcga tgtacacagt acgacttttg tatttgtgta ctgtatatag tataataaaa 7740 gcatagagaa aactcactat gaaatgactt tctctatgct actactaaaa cacgcaaagg 7800 agcgtattta tactatgatt ataccagaaa aacaaaacaa acaaaagcag gtgctgacct 7860 tgaatgaatt agaaaaacgc aaagtagtgg agcataacgc tttaattcaa agtgtggcta 7920 aaatgcaaaa aacagcttta aaaatgtttg agttagccgt gtcttgtatt gataccgaag 7980 aacccccaaa aaataatact gtttatcttt taaaatctga actttttaaa ttttttgaag 8040 tatcgtctag cagtaagcat agtcaattca aagaagcagt taattatatg caaaagcaag 8100 cctttttcaa tattaaagct gataagaaac taggtattga gtatgaaagc attgtgccta 8160 ttccttatgt aaaatggaac gattataacg atgaagtaac aatacgtttt gaccaagcca 8220 ttatgcccta tctgattgat ttaaaggcag aatttactca atataaaatt tccgaactac 8280 aaaaactcaa tagcaagtat tcgattattt tgtatcgttg gttatccatg aattataacc 8340 aatacgaaca ttacagcgtg aaagggggac ggagagttga acaagtggaa tcctaccgca 8400 acccctcaat aaaagtgaaa gaattgcgag aaataactga tacaataaat gaacatcaac 8460 attttcccca ttttgaaact agagtattaa aaaaagcaat tgaagaaatc aacgctcaca 8520 cctcttttaa tgtaacctat gagaaaaaga aagcagggcg gagcattgac tctattgtct 8580 ttcatattga gaagaaacgc atggcagacg ataacagtta caagttggga gataaagact 8640 atcaagacga taaaaagcaa aaatcaagaa atgaagctga cttactaaaa caggcaatgg 8700 aaagtaaata tacaagactt ttatctgaaa atttcttaat tggcatgaat gatattatgg 8760 atacagccac aatggtaggc ttacagaaga atgtttatcc gctttatgac gaactgaaag 8820 aattaagagg gctaaatggt gtcaaagacc acttgtctta tgttgccagc aaaaaagaag 8880 cctattctaa acgtaatgta gcgaaatatc ttaaaaaagc cattgaacaa tatctgccaa 8940 cggttaaaag gcaggactta aaccatgagt gaagacttaa aaacgataaa agagttggcg 9000 gacgagctag gactagctaa acagaccgtt caataccatt taaagttttt gccgaccgaa 9060 aatcgacaaa aaaacaatag tggttcgatc gtcctaaacc ttgaagaaca aaagtttatc 9120 aaaaaaagac taacaaaaat aaaaaaagtt caaccgaccg aaaatcgaca agaaaaaatg 9180 aagaattaag cgttaatagt cttttgttag gaattccgga tgagcattca tcaggcgggc 9240 aagaatgtga ataaaggccg gataaaactt gtgcttattt ttctttacgg tctttaaaaa 9300 ggccgtaata tccagctgaa cggtctggtt ataggtacat tgagcaactg actgaaatgc 9360 ctcaaaatgt tctttacgat gccattggga tatatcaacg gtggtatatc cagtgatttt 9420 tttctccatt ttagcttcct tagctcctga aaatctcgat aactcaaaaa atacgcccgg 9480 tagtgatctt atttcattat ggtgaaagtt ggaacctctt acgtgccgat caacgtctca 9540 ttttcgccaa aagttggccc agggcttccc ggtatcaaca gggacaccag gatttattta 9600 ttctgcgaag tgatcttccg tcacaggtat ttattcggcg caaagtgcgt cgggtgatgc 9660 tgccaactta ctgatttagt gtatgatggt gtttttgagg tgctccagtg gcttctgttt 9720 ctatcagctg tccctcctgt tcagctactg acggggtggt gcgtaacggc aaaagcaccg 9780 ccggacatca gcgctagcgg agtgtatact ggcttactat gttggcactg atgagggtgt 9840 cagtgaagtg cttcatgtgg caggagaaaa aaggctgcac cggtgcgtca gcagaatatg 9900 tgatacagga tatattccgc ttcctcgctc actgactcgc tacgctcggt cgttcgactg 9960 cggcgagcgg aaatggctta cgaacggggc ggagatttcc tggaagatgc caggaagata 10020 cttaacaggg aagtgagagg gccgcggcaa agccgttttt ccataggctc cgcccccctg 10080 acaagcatca cgaaatctga cgctcaaatc agtggtggcg aaacccgaca ggactataaa 10140 gataccaggc gtttccccct ggcggctccc tcgtgcgctc tcctgttcct gcctttcggt 10200 ttaccggtgt cattccgctg ttatggccgc gtttgtctca ttccacgcct gacactcagt 10260 tccgggtagg cagttcgctc caagctggac tgtatgcacg aaccccccgt tcagtccgac 10320 cgctgcgcct tatccggtaa ctatcgtctt gagtccaacc cggaaagaca tgcaaaagca 10380 ccactggcag cagccactgg taattgattt agaggagtta gtcttgaagt catgcgccgg 10440 ttaaggctaa actgaaagga caagttttgg tgactgcgct cctccaagcc agttacctcg 10500 gttcaaagag ttggtagctc agagaacctt cgaaaaaccg ccctgcaagg cggttttttc 10560 gttttcagag caagagatta cgcgcagacc aaaacgatct caagaagatc atcttattaa 10620 tcagataaaa tatttctaga tttcagtgca atttatctct tcaaatgtag cacctgaagt 10680 cagccccata cgatataagt tgtaattctc atgtttgaca gcttatcatc gatacaaatt 10740 ccccgtaggc gctagggacc tctttagctt cttggaagct gtcagtagta tatctaataa 10800 tttatctcca ttccctttag taacgtgtaa ctttccaaat ttaaaaaagc gactcataga 10860 attatttcct cccgttaaat aatagataac tattaaaaat agacaatact tgctcataag 10920 taatggtact taaattgttt actttggcgt gtttcattgc ttgatgaaac tgatttttag 10980 taaacagttg acgatattct cgattgaccc attttgaaac aaagtacgta tatagcttcc 11040 aatatttatc tggaacatct gtggtatggc gggtaagttt tattaagaca ctgtttactt 11100 ttggtttagg atgaaagcat tccgctggca gcttaagcaa ttgctgaatc gagacttgag 11160 tgtgcaagag caaccctagt gttcggtgaa tatccaaggt acgcttgtag aatccttctt 11220 caacaatcag atagatgtca gacgcacggc tttcaaaaac cactttttta ataatttgtg 11280 tgcttaaatg gtaaggaata ttcccaacaa ttttatacct ctgtttgtta gggaattgaa 11340 actgtagaat atcttggtga attaaagtga cacgagtatt cagttttaat ttttctgacg 11400 ataagttgaa tagatgactg tctaattcaa tagacgttac ctgtttactt attttagcca 11460 gtttcgtcgt taaatgccct ttacctgttc caatttcgta aacggtatcg gtttctttta 11520 aattcaattg ttttattatt tggttgagta ctttttcact cgttaaaaag ttttgagaat 11580 attttatatt tttgttcatg taatctctcc tgaagtgatt acatctataa acaaatacag 11640 aagttaaacg atttgtttgt aattttagtt atctgtttaa aaagtcataa gattagtcac 11700 tggtaggaat taatctaacg tatttattta tctgcgtaat cactgttttt agtctgtttc 11760 aaaacagtag atgttttatc tacattacgc atttggaata ccaacatgac gaatccctcc 11820 ttcttaatta caaattttta gcatctaatt taacttcaat tcctattata caaaatttta 11880 agataatgca ctatcaacac actcttaagt ttgcttctaa gtcttatttc cataacttct 11940 tttacgtttc cgccattctt tgctgtttcg atttttatga tatggtgcaa gtcagcacga 12000 acacgaaccg tcttatctcc cattatatct ttttttgcac tgattggtgt atcatttcgt 12060 ttttcttttt gtgcgcctaa atttcccaca atcactcact tctttctatt tcttcttatt 12120 cttattttat catcaacaat cacaaatcac ttgtgatttg tgataagtga tttgtgataa 12180 gtgatttgtg attaatatat aaaagccctc tttaaagggc ttttatgttt attttgagaa 12240 agatataaaa tcaatatatc ccttttcccc aatttttaca acggcattgt agggcttttt 12300 ctctttgttt ttgatttcct tttaccaggg tttcttttcc ctccagtaat tcttttacat 12360 tcgttttggt tagttttttc tttctaaaat gttcagctaa agtaaactta cattcaggat 12420 aatttgaaca accataaaac gattttttta atacaatatt gttgccacac ttaggacatt 12480 ttcctacaat acttttttct gcttcttttt ctttctgttc ctggtaatca gaaaaattta 12540 gtttttctat atcgttaggt acagcttcca gtaaatgaac aatgaatttt ttgatattcg 12600 taataaagtt ctcttgattg ccttctcttt taccgatttt ttttaaatac gtttcccatt 12660 tagccgtcat ttcagcactc gttaaaaggt gctgactttc aactgcctgg cacaataatt 12720 ttcctttttc agttacaac 12739 9 57 DNA Artificial Sequence Oligonucleotide P59 9 gaccgacatt aaattcttga cagggagaga taggtttgat agaatataat agttgtc 57 10 58 DNA Artificial Sequence Oligonucleotide P59 complement 10 tcgagacaac tattatattc tatcaaacct atctctccct gtcaagaatt taatgtcg 58 11 17 DNA Artificial Sequence Lac operator sequence 11 ttgttatccg ctcacaa 17 12 94 DNA Artificial Sequence Oligonucleotide Xyl/Lac-T5 12 gaccgtcata aaaaatttat ttgctttgtg agcggataac aattataata gattcaagtt 60 agtttgttta ttaaattaac caactaaaat gtag 94 13 95 DNA Artificial Sequence Oligonucleotide Xyl/Lac-T5 complement 13 aattctacat tttagttggt taatttaata aacaaactaa cttgaatcta ttataattgt 60 tatccgctca caaagcaaat aaatttttta tgacg 95 14 57 DNA Artificial Sequence Oligonucleotide Xyl/Lac-P59 14 gaccgacatt aaattcttga cattgtgagc ggataacaat agaatataat agttgtc 57 15 58 DNA Artificial Sequence Oligonucleotide Xyl/Lac P59 complement 15 tcgagacaac tattatattc tattgttatc cgctcacaat gtcaagaatt taatgtcg 58 16 6852 DNA Artificial Sequence Vector pXyl-T5 p15a 16 tggcattcta cgactataac ttaaatttat attttttact ttataatata taattgatta 60 tagaataatg ttgctcatat cgtttgccaa catctagtac tcaaattaca ctatgttaca 120 cttggtaata ttaaccgaac ttcccctgtc caaattagat aagaggtaat aataaatgga 180 aaataatttt atagtaaatg aaaatgagaa gcgtgtatta aaacaaattt tcaataacag 240 caatatttca cgaacacaaa tatcgaagaa tttagaactt aataaagcta ctatttctaa 300 cattctgaac aacttaaaac acaagagttt agttaatgaa gtaggagaag gtaatagtac 360 taaaagtggt ggacgaaagc ctattttact cgaaattaac caaaaatatg gctactatat 420 ttctatggat ttaacatatg attccgttga attaatgtac aactactttg atgctactat 480 attaaagcaa gattcctacg aattaaatga taaaaatgta agcagtatat tacaaatttt 540 aaaatctaat ataaacgtct cagaaaaata tgatacgtta tatgggttac ttggtatatc 600 tatatccata cacggtatcg ttgacgatga gcaaaacata atcaatcttc cttttcataa 660 aaatgagaaa cgcacattta ccgatgaatt aaagtcattc acaaatgttc ctgtcgttat 720 agaaaatgaa gcaaatttat cagcgctata tgaaaaaagt ttatatatta attcaaacat 780 aaataatttg attactttaa gtattcacaa gggtataggc gctggcatca taataaataa 840 aaaactttat cgtggctcaa atggagaggc tggagagata ggtaagacat tggttttgga 900 atctataaat aacaatgaca acaaatatta taaaatcgaa gatatatgct cccaagacgc 960 tttaatacag aaaataaata ataggttggg cgtcacattg acgtttacag aactaatcca 1020 atattacaac gaaggaaatt caattgttgc tcatgaaatt aaacaattta ttaataaaat 1080 gacagttctg attcataatt tgaatacaca atttaaccca gacgctattt atattaactg 1140 tcctttaatt aatgaattac caaatatttt aaatgaaatt aaagagcaat tctcctgttt 1200 ttctcaaggc agtccaattc aattacattt aactactaat gtaaaacaag ctactttatt 1260 gggtggcact ttagcaataa tgcaaaaaac attaaatata aataacattc aaatgaatat 1320 taaataatta cagcagtctg agttataaaa tagatatctc ggaccgtcat aaaaaattta 1380 tttgctttca ggaaaatttt tctgtataat agattcaagt tagtttgttt attaaattaa 1440 ccaactaaaa tgtagaattc gagctcggta cccggggatc ctctagagtc gacctgcagc 1500 caagcttggg cttttcagcc tgatacagat taaatcagaa cgcagaagcg gtctgataaa 1560 acagaatttg cctggcggca gtagcgcggt ggtcccacct gaccccatgc cgaactcaga 1620 agtgaaacgc cgtagcgccg atggtagtgt ggggtctccc catgcgagag tagggaactg 1680 ccaggcatca aataaaacga aaggctcagt cgaaagactg ggcctttcgt tttatctgtt 1740 gtttgtcggt gaacgctctc ctgagtagga caaatccgcc gggagcggat ttgaacgttg 1800 cgaagcaacg gcccggaggg tggcgggcag gacgcccgcc ataaactgcc aggcatcaaa 1860 ttaagcagaa ggccatcctg acggatggcc tttttgcgtt tctacaaact ctttttgttt 1920 atttttctaa atacattcaa atatgtatcc gctgatcccc atcctatcga tgataagctg 1980 tcaaacatga gaattaaatc aatctaaagt atatatgagt aaacttggtc tgacagttac 2040 caatgcttaa tcagtgaggc acctatctca gcgatctgtc tatttcgttc atccatagtt 2100 gcctgactcc ccgtcgtgta gataactacg atacgggagg gcttaccatc tggccccagt 2160 gctgcaatga taccgcgaga cccacgctca ccggctccag atttatcagc aataaaccag 2220 ccagccggaa gggccgagcg cagaagtggt cctgcaactt tatccgcctc catccagtct 2280 attaattgtt gccgggaagc tagagtaagt agttcgccag ttaatagttt gcgcaacgtt 2340 gttgccattg ctacaggcat cgtggtgtca cgctcgtcgt ttggtatggc ttcattcagc 2400 tccggttccc aacgatcaag gcgagttaca tgatccccca tgttgtgcaa aaaagcggtt 2460 agctccttcg gtcctccgat cgttgtcaga agtaagttgg ccgcagtgtt atcactcatg 2520 gttatggcag cactgcataa ttctcttact gtcatgccat ccgtaagatg cttttctgtg 2580 actggtgagt actcaaccaa gtcattctga gaatagtgta tgcggcgacc gagttgctct 2640 tgcccggcgt caacacggga taataccgcg ccacatagca gaactttaaa agtgctcatc 2700 attggaaaac gctcttcggg gcgaaaactc tcaaggatct taccgctgtt gagatccagt 2760 tcgatgtaac ccactcgtgc acccaactga tcttcagcat cttttacttt caccagcgtt 2820 tctgggtgag caaaaacagg aaggcaaaat gccgcaaaaa agggaataag ggcgacacgg 2880 aaatgttgaa tactcatact cttccttttt caatattatt gaagcattta tcagggttat 2940 tgtctcatga gcggatacat atttgaatgt atttagaaaa ataaacaaat aggggttccg 3000 cgcacatttc cccgaaaagt gccacctggc ggccgcataa gatgatcttc ttgagatcgt 3060 tttggtctgc gcgtaatctc ttgctctgaa aacgaaaaaa ccgccttgca gggcggtttt 3120 tcgaaggttc tctgagctac caactctttg aaccgaggta actggcttgg aggagcgcag 3180 tcaccaaaac ttgtcctttc agtttagcct taaccggcgc atgacttcaa gactaactcc 3240 tctaaatcaa ttaccagtgg ctgctgccag tggtgctttt gcatgtcttt ccgggttgga 3300 ctcaagacga tagttaccgg ataaggcgca gcggtcggac tgaacggggg gttcgtgcat 3360 acagtccagc ttggagcgaa ctgcctaccc ggaactgagt gtcaggcgtg gaatgagaca 3420 aacgcggcca taacagcgga atgacaccgg taaaccgaaa ggcaggaaca ggagagcgca 3480 cgagggagcc gccaggggaa acgcctggta tctttatagt cctgtcgggt ttcgccacca 3540 ctgatttgag cgtcagattt cgtgatgctt gtcagggggg cggagcctat ggaaaaacgg 3600 ctttgccgcg gccctctcac ttccctgtta agtatcttcc tggcatcttc caggaaatct 3660 ccgccccgtt cgtaagccat ttccgctcgc cgcagtcgaa cgaccgagcg tagcgagtca 3720 gtgagcgagg aagcggaata tatcctgtat cacatattct gctgacgcac cggtgcagcc 3780 ttttttctcc tgccacatga agcacttcac tgacaccctc atcagtgcca acatagtaag 3840 ccagtgcggc cgcttctttc ctgcgttatc ccctgattct gtggataacc gtattaccgc 3900 ctttgagtga gctgataccg ctcgccgcag ccgaacgacc gagcgcagcg agtcagtgag 3960 cgaggaagcg gaagagcgcc tgatgcggta ttttctcctt acgcatctgt gcggtatttc 4020 acaccgcata agatcccctc gacctgcagg catgcaagct tctgtaggtt tttaggcata 4080 aaactatatg atttacccct aaatctttaa aatgcccctt aaaattcaaa ataaaggcat 4140 ttaaaattta aatatttctt gtgataaagt ttgttaaaaa ggagtggttt tatgactgtt 4200 atgtggttat cgattatagg tatgtggttt tgtattggaa tggcattttt tgctatcaag 4260 gttattaaaa ataaaaatta gaccacgcat ttatgccgag aaaatttatt gtgcgttgag 4320 aagaaccctt aactaaactt gcagacgaat gtcggcatag cgtgagctat taagccgacc 4380 attcgacaag ttttgggatt gttaagggtt ccgaggctca acgtcaataa agcaattgga 4440 ataaagaagc gaaaaaggag aagtcggttc agaaaaagaa ggatatggat ctggagctgt 4500 aatataaaaa ccttcttcaa ctaacggggc aggttagtga cattagaaaa ccgactgtaa 4560 aaagtacagt cggcattatc tcatattata aaagccagtc attaggccta tctgacaatt 4620 cctgaataga gttcataaac aatcctgcat gataaccatc acaaacagaa tgatgtacct 4680 gtaaagatag cggtaaatat attgaattac ctttattaat gaattttcct gctgtaataa 4740 tgggtagaag gtaattacta ttattattga tatttaagtt aaacccagta aatgaagtcc 4800 atggaataat agaaagagaa aaagcatttt caggtatagg tgttttggga aacaatttcc 4860 ccgaaccatt atatttctct acatcagaaa ggtataaatc ataaaactct ttgaagtcat 4920 tctttacagg agtccaaata ccagagaatg ttttagatac accatcaaaa attgtataaa 4980 gtggctctaa cttatcccaa taacctaact ctccgtcgct attgtaacca gttctaaaag 5040 ctgtatttga gtttatcacc cttgtcacta agaaaataaa tgcagggtaa aatttatatc 5100 cttcttgttt tatgtttcgg tataaaacac taatatcaat ttctgtggtt atactaaaag 5160 tcgtttgttg gttcaaataa tgattaaata tctcttttct cttccaattg tctaaatcaa 5220 ttttattaaa gttcatttga tatgcctcct aaatttttat ctaaagtgaa tttaggaggc 5280 ttacttgtct gctttcttca ttagaatcaa tcctttttta aaagtcaata ttactgtaac 5340 ataaatatat attttaaaaa tatcccactt tatccaattt tcgtttgttg aactaatggg 5400 tgctttagtt gaagaataaa gaccacatta aaaaatgtgg tcttttgtgt ttttttaaag 5460 gatttgagcg tagcgaaaaa tccttttctt tcttatcttg ataataaggg taactattgc 5520 cggcgaggct agttaccctt aagttattgg tatgactggt tttaagcgca aaaaaagttg 5580 ctttttcgta cctattaatg tatcgtttta aatgactagt aaaaaacata catagaaagg 5640 ggaaaaagca acttttttta ttgtcatagt ttgtgaaaac taagttgttt ttatgtgtta 5700 taacatggaa aagtatactg agaaaaaaca aagaaatcaa gtatttcaga aatttattaa 5760 acgtcatatt ggagagaatc aaatggattt agttgaagat tgcaatacat ttctgtcttt 5820 tgtagctgat aaaactttag aaaaacagaa attatataaa gctaattctt gtaaaaatcg 5880 attttgtcct gtctgtgctt ggagaaaagc tagaaaagat gcattgggtt tatctttgat 5940 gatgcaatat attaagcagc aagagaaaaa ggagtttatc tttttaactt tgactacacc 6000 taatgtaatg agtgatgaat tagaaaatga aataaaacgt tataataatt cttttagaaa 6060 acttataaag agaaaaaaag taggtagtgt tataaaggga tatgttcgta agttagagat 6120 tacatataat aaaaaaagag atgattataa tcctcatttt catgtgttaa ttgcagtaaa 6180 taaatcgtat ttcacagata aaagatatta tattagccaa caagaatggt tagatttatg 6240 gcgtgatgta acgggcattt cagaaataac acaagttcaa gttcaaaaaa taagacaaaa 6300 taataataaa gaattatatg aaatggctaa gtattctggt aaagatagtg attatttaat 6360 aaataaatca aaaagtcttt gatgcatttt ataaatcact taaaggtaaa caggtattag 6420 tttattcagg attatttaaa gaggctaaaa agaaattaaa aaatggggat ttagattact 6480 taaaagaaat tgatccaacc gtaatatatc tatcaaattt tttatatttg gaaacaaaaa 6540 gagtatttag ctagtgaact ttatgactta acagaacaag aaaaaagaga aattaatcac 6600 aaaatgatag acgaaatcga ggaagaacaa taacaaaata taagtgctaa cagtcgtctg 6660 caagtttagt taagggttct tctcaacgca caataaattt tctcggcata aatgcgtggt 6720 ctaattttta tttttaataa ccttgatagc aaaaaatgcc attccaatac aaaaccacat 6780 acctataatc gataaccaca taacagtcat aaaaccactc ctttttaaca aactttatca 6840 caagaaatat tt 6852 17 9437 DNA Artificial Sequence Vector pEPEF3 17 caacccagtc agctccttcc ggtgggcgcg gggcatgact atcgtcgccg cacttatgac 60 tgtcttcttt atcatgcaac tcgtaggaca ggtgccggca gcgctctggg tcattttcgg 120 cgaggaccgc tttcgctgga gcgcgacgat gatcggcctg tcgcttgcgg tattcggaat 180 cttgcacgcc ctcgctcaag ccttcgtcac tggtcccgcc accaaacgtt tcggcgagaa 240 gcaggccatt atcgccggca tggcggccga cgcgctgggc tacgtcttgc tggcgttcgc 300 gacgcgaggc tggatggcct tccccattat gattcttctc gcttccggcg gcatcgggat 360 gcccgcgttg caggccatgc tgtccaggca ggtagatgac gaccatcagg gacagcttca 420 aggatcgctc gcggctctta ccagcctaac ttcgatcact ggaccgctga tcgtcacggc 480 gatttatgcc gcctcggcga gcacatggaa cgggttggca tggattgtag gcgccgccct 540 ataccttgtc tgcctccccg cgttgcgtcg cggtgcatgg agccgggcca cctcgacctg 600 aatggaagcc ggcggcacct cgctaacgga ttcaccactc caagaattgg agccaatcaa 660 ttcttgcgga gaactgtgaa tgcgcaaacc aacccttggc agaacatatc catcgcgtcc 720 gccatctcca gcagccgcac gcggcgcatc tcgggcagcg ttgggtcctg gccacgggtg 780 cgcatgatcg tgctcctgtc gttgaggacc cggctaggct ggcggggttg ccttactggt 840 tagcagaatg aatcaccgat acgcgagcga acgtgaagcg actgctgctg caaaacgtct 900 gcgacctgag caacaacatg aatggtcttc ggtttccgtg tttcgtaaag tctggaaacg 960 cggaagtccc ctacgtgctg ctgaagttgc ccgcaacaga gagtggaacc aaccggtgat 1020 accacgatac tatgactgag agtcaacgcc atgagcggcc tcatttctta ttctgagtta 1080 caacagtccg caccgctgtc cggtagctcc ttccggtggg cgcggggcat gactatcgtc 1140 gccgcactta tgactgtctt ctttatcatg caactcgtag gacaggtgcc ggcagcgccc 1200 aacagtcccc cggccacggg gcctgccacc atacccacgc cgaaacaagc gccctgcacc 1260 attatgttcc ggatctgcat cgcaggatgc tgctggctac cctgtggaac acctacatct 1320 gtattaacga agcgctaacc gtttttatca ggctctggga ggcagaataa atgatcatat 1380 cgtcaattat tacctccacg gggagagcct gagcaaactg gcctcaggca tttgagaagc 1440 acacggtcac actgcttccg gtagtcaata aaccggtaaa ccagcaatag acataagcgg 1500 ctatttaacg accctgccct gaaccgacga ccgggtcgaa tttgctttcg aatttctgcc 1560 attcatccgc ttattatcac ttattcaggc gtagcaccag gcgtttaagg gcaccaataa 1620 ctgccttaaa aaaattacgc cccgccctgc cactcatcgc agtactgttg taattcatta 1680 agcattctgc cgacatggaa gccatcacag acggcatgat gaacctgaat cgccagcggc 1740 atcagcacct tgtcgccttg cgtataatat ttgcccatgg tgaaaacggg ggcgaagaag 1800 ttgtccatat tggccacgtt taaatcaaaa ctggtgaaac tcacccaggg attggctgag 1860 acgaaaaaca tattctcaat aaacccttta gggaaatagg ccaggttttc accgtaacac 1920 gccacatctt gcgaatatat gtgtagaaac tgccggaaat cgtcgtggta ttcactccag 1980 agcgatgaaa acgtttcagt ttgctcatgg aaaacggtgt aacaagggtg aacactatcc 2040 catatcacca gctcaccgtc tttcattgcc atacgaattc agaggtttga tgactttgac 2100 ctttggtttt aaatttttga aaaaaataaa aaaaggcgaa gcctattata tatttatcat 2160 atatatttta atcttttgtt cttttgcgtg aaaaaaaagg cagtgttttc gctagttata 2220 gaaattaaac agtcacaaaa atcgatgtat acagtcacaa aaatcgattt ttgtgactct 2280 atgcatcgat gtatacgagt cacaaaaatc gatgtacaca gtacgacttt tgtatttgtg 2340 tactgtatat agtataataa aagcatagag aaaactcact atgaaatgac tttctctatg 2400 ctactactaa aacacgcaaa ggagcgtatt tatactatga ttataccaga aaaacaaaac 2460 aaacaaaagc aggtgctgac cttgaatgaa ttagaaaaac gcaaagtagt ggagcataac 2520 gctttaattc aaagtgtggc taaaatgcaa aaaacagctt taaaaatgtt tgagttagcc 2580 gtgtcttgta ttgataccga agaaccccca aaaaataata ctgtttatct tttaaaatct 2640 gaacttttta aattttttga agtatcgtct agcagtaagc atagtcaatt caaagaagca 2700 gttaattata tgcaaaagca agcctttttc aatattaaag ctgataagaa actaggtatt 2760 gagtatgaaa gcattgtgcc tattccttat gtaaaatgga acgattataa cgatgaagta 2820 acaatacgtt ttgaccaagc cattatgccc tatctgattg atttaaaggc agaatttact 2880 caatataaaa tttccgaact acaaaaactc aatagcaagt attcgattat tttgtatcgt 2940 tggttatcca tgaattataa ccaatacgaa cattacagcg tgaaaggggg acggagagtt 3000 gaacaagtgg aatcctaccg caacccctca ataaaagtga aagaattgcg agaaataact 3060 gatacaataa atgaacatca acattttccc cattttgaaa ctagagtatt aaaaaaagca 3120 attgaagaaa tcaacgctca cacctctttt aatgtaacct atgagaaaaa gaaagcaggg 3180 cggagcattg actctattgt ctttcatatt gagaagaaac gcatggcaga cgataacagt 3240 tacaagttgg gagataaaga ctatcaagac gataaaaagc aaaaatcaag aaatgaagct 3300 gacttactaa aacaggcaat ggaaagtaaa tatacaagac ttttatctga aaatttctta 3360 attggcatga atgatattat ggatacagcc acaatggtag gcttacagaa gaatgtttat 3420 ccgctttatg acgaactgaa agaattaaga gggctaaatg gtgtcaaaga ccacttgtct 3480 tatgttgcca gcaaaaaaga agcctattct aaacgtaatg tagcgaaata tcttaaaaaa 3540 gccattgaac aatatctgcc aacggttaaa aggcaggact taaaccatga gtgaagactt 3600 aaaaacgata aaagagttgg cggacgagct aggactagct aaacagaccg ttcaatacca 3660 tttaaagttt ttgccgaccg aaaatcgaca aaaaaacaat agtggttcga tcgtcctaaa 3720 ccttgaagaa caaaagttta tcaaaaaaag actaacaaaa ataaaaaaag ttcaaccgac 3780 cgaaaatcga caagaaaaaa tgaagaatta agcgttaata gtcttttgtt aggaattccg 3840 gatgagcatt catcaggcgg gcaagaatgt gaataaaggc cggataaaac ttgtgcttat 3900 ttttctttac ggtctttaaa aaggccgtaa tatccagctg aacggtctgg ttataggtac 3960 attgagcaac tgactgaaat gcctcaaaat gttctttacg atgccattgg gatatatcaa 4020 cggtggtata tccagtgatt tttttctcca ttttagcttc cttagctcct gaaaatctcg 4080 ataactcaaa aaatacgccc ggtagtgatc ttatttcatt atggtgaaag ttggaacctc 4140 ttacgtgccg atcaacgtct cattttcgcc aaaagttggc ccagggcttc ccggtatcaa 4200 cagggacacc aggatttatt tattctgcga agtgatcttc cgtcacaggt atttattcgg 4260 cgcaaagtgc gtcgggtgat gctgccaact tactgattta gtgtatgatg gtgtttttga 4320 ggtgctccag tggcttctgt ttctatcagc tgtccctcct gttcagctac tgacggggtg 4380 gtgcgtaacg gcaaaagcac cgccggacat cagcgctagc ggagtgtata ctggcttact 4440 atgttggcac tgatgagggt gtcagtgaag tgcttcatgt ggcaggagaa aaaaggctgc 4500 accggtgcgt cagcagaata tgtgatacag gatatattcc gcttcctcgc tcactgactc 4560 gctacgctcg gtcgttcgac tgcggcgagc ggaaatggct tacgaacggg gcggagattt 4620 cctggaagat gccaggaaga tacttaacag ggaagtgaga gggccgcggc aaagccgttt 4680 ttccataggc tccgcccccc tgacaagcat cacgaaatct gacgctcaaa tcagtggtgg 4740 cgaaacccga caggactata aagataccag gcgtttcccc ctggcggctc cctcgtgcgc 4800 tctcctgttc ctgcctttcg gtttaccggt gtcattccgc tgttatggcc gcgtttgtct 4860 cattccacgc ctgacactca gttccgggta ggcagttcgc tccaagctgg actgtatgca 4920 cgaacccccc gttcagtccg accgctgcgc cttatccggt aactatcgtc ttgagtccaa 4980 cccggaaaga catgcaaaag caccactggc agcagccact ggtaattgat ttagaggagt 5040 tagtcttgaa gtcatgcgcc ggttaaggct aaactgaaag gacaagtttt ggtgactgcg 5100 ctcctccaag ccagttacct cggttcaaag agttggtagc tcagagaacc ttcgaaaaac 5160 cgccctgcaa ggcggttttt tcgttttcag agcaagagat tacgcgcaga ccaaaacgat 5220 ctcaagaaga tcatcttatt aatcagataa aatatttcta gatttcagtg caatttatct 5280 cttcaaatgt agcacctgaa gtcagcccca tacgatataa gttgtaattc tcatgtttga 5340 cagcttatca tcgatacaaa ttccccgtag gcgctaggga cctctttagc ttcttggaag 5400 ctgtcagtag tatatctaat aatttatctc cattcccttt agtaacgtgt aactttccaa 5460 atttaaaaaa gcgactcata gaattatttc ctcccgttaa ataatagata actattaaaa 5520 atagacaata cttgctcata agtaatggta cttaaattgt ttactttggc gtgtttcatt 5580 gcttgatgaa actgattttt agtaaacagt tgacgatatt ctcgattgac ccattttgaa 5640 acaaagtacg tatatagctt ccaatattta tctggaacat ctgtggtatg gcgggtaagt 5700 tttattaaga cactgtttac ttttggttta ggatgaaagc attccgctgg cagcttaagc 5760 aattgctgaa tcgagacttg agtgtgcaag agcaacccta gtgttcggtg aatatccaag 5820 gtacgcttgt agaatccttc ttcaacaatc agatagatgt cagacgcacg gctttcaaaa 5880 accacttttt taataatttg tgtgcttaaa tggtaaggaa tattcccaac aattttatac 5940 ctctgtttgt tagggaattg aaactgtaga atatcttggt gaattaaagt gacacgagta 6000 ttcagtttta atttttctga cgataagttg aatagatgac tgtctaattc aatagacgtt 6060 acctgtttac ttattttagc cagtttcgtc gttaaatgcc ctttacctgt tccaatttcg 6120 taaacggtat cggtttcttt taaattcaat tgttttatta tttggttgag tactttttca 6180 ctcgttaaaa agttttgaga atattttata tttttgttca tgtaatctct cctgaagtga 6240 ttacatctat aaacaaatac agaagttaaa cgatttgttt gtaattttag ttatctgttt 6300 aaaaagtcat aagattagtc actggtagga attaatctaa cgtatttatt tatctgcgta 6360 atcactgttt ttagtctgtt tcaaaacagt agatgtttta tctacattac gcatttggaa 6420 taccaacatg acgaatccct ccttcttaat tacaaatttt tagcatctaa tttaacttca 6480 attcctatta tacaaaattt taagataatg cactatcaac acactcttaa gtttgcttct 6540 aagtcttatt tccataactt cttttacgtt tccgccattc tttgctgttt cgatttttat 6600 gatatggtgc aagtcagcac gaacacgaac cgtcttatct cccattatat ctttttttgc 6660 actgattggt gtatcatttc gtttttcttt ttgtgcgcct aaatttccca caatcactca 6720 cttctttcta tttcttctta ttcttatttt atcatcaaca atcacaaatc acttgtgatt 6780 tgtgataagt gatttgtgat aagtgatttg tgattaatat ataaaagccc tctttaaagg 6840 gcttttatgt ttattttgag aaagatataa aatcaatata tcccttttcc ccaattttta 6900 caacggcatt gtagggcttt ttctctttgt ttttgatttc cttttaccag ggtttctttt 6960 ccctccagta attcttttac attcgttttg gttagttttt tctttctaaa atgttcagct 7020 aaagtaaact tacattcagg ataatttgaa caaccataaa acgatttttt taatacaata 7080 ttgttgccac acttaggaca ttttcctaca atactttttt ctgcttcttt ttctttctgt 7140 tcctggtaat cagaaaaatt tagtttttct atatcgttag gtacagcttc cagtaaatga 7200 acaatgaatt ttttgatatt cgtaataaag ttctcttgat tgccttctct tttaccgatt 7260 ttttttaaat acgtttccca tttagccgtc atttcagcac tcgttaaaag gtgctgactt 7320 tcaactgcct ggcacaataa ttttcctttt tcagttacaa caagcttgcc tgcaggtcga 7380 cgataaccac ataacagtca taaaaccact cctttttaac aaactttatc acaagaaata 7440 ttttggcatt ctacgactat aacttaaatt tatatttttt actttataat atataattga 7500 ttatagaata atgttgctca tatcgtttgc caacatctag tactcaaatt acactatgtt 7560 acacttggta atattaaccg aacttcccct gtccaaatta gataagaggt aataataaat 7620 ggaaaataat tttatagtaa atgaaaatga gaagcgtgta ttaaaacaaa ttttcaataa 7680 cagcaatatt tcacgaacac aaatatcgaa gaatttagaa cttaataaag ctactatttc 7740 taacattctg aacaacttaa aacacaagag tttagttaat gaagtaggag aaggtaatag 7800 tactaaaagt ggtggacgaa agcctatttt actcgaaatt aaccaaaaat atggctacta 7860 tatttctatg gatttaacat atgattccgt tgaattaatg tacaactact ttgatgctac 7920 tatattaaag caagattcct acgaattaaa tgataaaaat gtaagcagta tattacaaat 7980 tttaaaatct aatataaacg tctcagaaaa atatgatacg ttatatgggt tacttggtat 8040 atctatatcc atacacggta tcgttgacga tgagcaaaac ataatcaatc ttccttttca 8100 taaaaatgag aaacgcacat ttaccgatga attaaagtca ttcacaaatg ttcctgtcgt 8160 tatagaaaat gaagcaaatt tatcagcgct atatgaaaaa agtttatata ttaattcaaa 8220 cataaataat ttgattactt taagtattca caagggtata ggcgctggca tcataataaa 8280 taaaaaactt tatcgtggct caaatggaga ggctggagag ataggtaaga cattggtttt 8340 ggaatctata aataacaatg acaacaaata ttataaaatc gaagatatat gctcccaaga 8400 cgctttaata cagaaaataa ataataggtt gggcgtcaca ttgacgttta cagaactaat 8460 ccaatattac aacgaaggaa attcaattgt tgctcatgaa attaaacaat ttattaataa 8520 aatgacagtt ctgattcata atttgaatac acaatttaac ccagacgcta tttatattaa 8580 ctgtccttta attaatgaat taccaaatat tttaaatgaa attaaagagc aattctcctg 8640 tttttctcaa ggcagtccaa ttcaattaca tttaactact aatgtaaaac aagctacttt 8700 attgggtggc actttagcaa taatgcaaaa aacattaaat ataaataaca ttcaaatgaa 8760 tattaaataa ttacagcagt ctgagttata aaatagatat ctcggaccgc tttggcagtt 8820 tattcttgac atgttagtga gggggctggt ataatcacat agtactgttc tcgagagtta 8880 gtttgtttat taaattaacc aactggatcc gaattcccgg ggatccgtcg acctgcagcc 8940 aagcttgggc ttttcagcct gatacagatt aaatcagaac gcagaagcgg tctgataaaa 9000 cagaatttgc ctggcggcag tagcgcggtg gtcccacctg accccatgcc gaactcagaa 9060 gtgaaacgcc gtagcgccga tggtagtgtg gggtctcccc atgcgagagt agggaactgc 9120 caggcatcaa ataaaacgaa aggctcagtc gaaagactgg gcctttcgtt ttatctgttg 9180 tttgtcggtg aacgctctcc tgagtaggac aaatccgccg ggagcggatt tgaacgttgc 9240 gaagcaacgg cccggagggt ggcgggcagg acgcccgcca taaactgcca ggcatcaaat 9300 taagcagaag gccatcctga cggatggcct ttttgcgttt ctacaaactc tttttgttta 9360 tttttctaaa tacattcaaa tatgtatccg ctgatcccca tcctatcgga tcctcgaccg 9420 atgcccttga gagcctt 9437 18 59 DNA Artificial Sequence Oligonucleotide PL10 18 gaccgtttcg tgaacttttt tgttgacaaa gataaaaaca catgatataa ttaaatcac 59 19 60 DNA Artificial Sequence Oligonucleotide PL10 complement 19 tcgagtgatt taattatatc atgtgttttt atctttgtca acaaaaaagt tcacgaaacg 60 20 64 DNA Artificial Sequence Oligonucleotide P59* 20 gaccgaaaaa tgacagttta ttcttgacag ggagagatag gtttgatata atataatagt 60 tgtc 64 21 65 DNA Artificial Sequence Oligonucleotide P59* complement 21 tcgagacaac tattatatta tatcaaacct atctctccct gtcaagaata aactgtcatt 60 tttcg 65 22 94 DNA Artificial Sequence Oligonucleotide Xyl-T5-DD 22 gaccgtcata aaaaatttat tttacatcag gaaaattttt ctgtatatta gattcaagtt 60 agtttgttta ttaaattaac caactaaaat gtag 94 23 95 DNA Artificial Sequence Oligonucleotide Xyl-T5-DD complement 23 aattctacat tttagttggt taatttaata aacaaactaa cttgaatcta atatacagaa 60 aattttcct gatgtaaaat aaatttttta tgacg 95 24 23 DNA Artificial Sequence pXyl-T5F primer 24 cagcagtctg agttataaaa tag 23 25 18 DNA Artificial Sequence LexL primer 25 tgttttatca gaccgctt 18 26 80 DNA Artificial Sequence Xyl-T5 fusion sequence 26 tcataaaaaa tttatttgct ttcaggaaaa tttttctgta taatagattc aagttagttt 60 gtttattaaa ttaaccaact 80 27 80 DNA Artificial Sequence Xyl-T5-DD fusion sequence 27 tcataaaaaa tttattttac atcaggaaaa tttttctgta tattagattc aagttagttt 60 gtttattaaa ttaaccaact 80 28 95 DNA Artificial Sequence CP25 fusion sequence 28 ctttggcagt ttattcttga catgttagtg agggggctgg tataatcaca tagtactgtt 60 ctcgagagtt agtttgttta ttaaattaac caact 95 29 86 DNA Artificial Sequence P32 fusion sequence 29 gtgagcttgg actagaaaaa aacttcacaa aatgctatac taggtaggta actcgagagt 60 tagtttgttt attaaattaa ccaact 86 30 86 DNA Artificial Sequence P59 fusion sequence 30 acattaaatt cttgacaggg agagataggt ttgatagaat ataatagttg tctcgagagt 60 tagtttgttt attaaattaa ccaact 86 31 93 DNA Artificial Sequence P59* fusion sequence 31 aaaaatgaca gtttattctt gacagggaga gataggtttg atataatata atagttgtct 60 cgagagttag tttgtttatt aaattaacca act 93 32 150 DNA Artificial Sequence P1P2 fusion sequence 32 gtctagttat tcttgacaaa aaataatatt ttgatataat taaatagttg tcgtttgata 60 acacagttta ttcttgacaa aaaaatataa aagtgtataa tagaaaagta ctgttctcga 120 gagttagttt gtttattaaa ttaaccaact 150 33 242 DNA Artificial Sequence P1P2* fusion sequence 33 34 88 DNA Artificial Sequence PL fusion sequence 34 tttcgtgaac ttttttgttg acaaagataa aaacacatga tatacttaaa tcactcgaga 60 gttagtttgt ttattaaatt aaccaact 88 35 88 DNA Artificial Sequence PL10 fusion sequence 35 tttcgtgaac ttttttgttg acaaagataa aaacacatga tataattaaa tcactcgaga 60 gttagtttgt ttattaaatt aaccaact 88 36 51 DNA Artificial Sequence Xyl-T5 promoter sequence 36 tcataaaaaa tttatttgct ttcaggaaaa tttttctgta taatagattc a 51 37 51 DNA Artificial Sequence Xyl-T5-DD promoter sequence 37 tcataaaaaa tttattttac atcaggaaaa tttttctgta tattagattc a 51 38 60 DNA Artificial Sequence CP25 promoter sequence 38 ctttggcagt ttattcttga catgttagtg agggggctgg tataatcaca tagtactgtt 60 39 51 DNA Artificial Sequence P32 promoter sequence 39 gtgagcttgg actagaaaaa aacttcacaa aatgctatac taggtaggta a 51 40 51 DNA Artificial Sequence P59 promoter sequence 40 acattaaatt cttgacaggg agagataggt ttgatagaat ataatagttg t 51 41 58 DNA Artificial Sequence P59* promoter sequence 41 aaaaatgaca gtttattctt gacagggaga gataggtttg atataatata atagttgt 58 42 115 DNA Artificial Sequence P1P2 promoter sequence 42 gtctagttat tcttgacaaa aaataatatt ttgatataat taaatagttg tcgtttgata 60 acacagttta ttcttgacaa aaaaatataa aagtgtataa tagaaaagta ctgtt 115 43 207 DNA Artificial Sequence P1P2* promoter sequence 43 aaacagccga ataaggctgt ttttcttttt tttatgtttt agaataagtg gtctagttta 60 ttcttgacaa aaaataatat tttgatataa ttaaatagtt gtcgtttgag acgactgact 120 ttcttattat tcatctaaaa tattattttg aaaagataac acagttattc ttgacaaaaa 180 aatataaaag tgtataatag aaaagta 207 44 43 DNA Artificial Sequence PL promoter sequence 44 ttttttgttg acaaagataa aaacacatga tatacttaaa tca 43 45 53 DNA Artificial Sequence PL10 promoter sequence 45 tttcgtgaac ttttttgttg acaaagataa aaacacatga tataattaaa tca 53 46 9425 DNA Artificial Sequence Vector pEPEF9 46 acccagtcag ctccttccgg tgggcgcggg gcatgactat cgtcgccgca cttatgactg 60 tcttctttat catgcaactc gtaggacagg tgccggcagc gctctgggtc attttcggcg 120 aggaccgctt tcgctggagc gcgacgatga tcggcctgtc gcttgcggta ttcggaatct 180 tgcacgccct cgctcaagcc ttcgtcactg gtcccgccac caaacgtttc ggcgagaagc 240 aggccattat cgccggcatg gcggccgacg cgctgggcta cgtcttgctg gcgttcgcga 300 cgcgaggctg gatggccttc cccattatga ttcttctcgc ttccggcggc atcgggatgc 360 ccgcgttgca ggccatgctg tccaggcagg tagatgacga ccatcaggga cagcttcaag 420 gatcgctcgc ggctcttacc agcctaactt cgatcactgg accgctgatc gtcacggcga 480 tttatgccgc ctcggcgagc acatggaacg ggttggcatg gattgtaggc gccgccctat 540 accttgtctg cctccccgcg ttgcgtcgcg gtgcatggag ccgggccacc tcgacctgaa 600 tggaagccgg cggcacctcg ctaacggatt caccactcca agaattggag ccaatcaatt 660 cttgcggaga actgtgaatg cgcaaaccaa cccttggcag aacatatcca tcgcgtccgc 720 catctccagc agccgcacgc ggcgcatctc gggcagcgtt gggtcctggc cacgggtgcg 780 catgatcgtg ctcctgtcgt tgaggacccg gctaggctgg cggggttgcc ttactggtta 840 gcagaatgaa tcaccgatac gcgagcgaac gtgaagcgac tgctgctgca aaacgtctgc 900 gacctgagca acaacatgaa tggtcttcgg tttccgtgtt tcgtaaagtc tggaaacgcg 960 gaagtcccct acgtgctgct gaagttgccc gcaacagaga gtggaaccaa ccggtgatac 1020 cacgatacta tgactgagag tcaacgccat gagcggcctc atttcttatt ctgagttaca 1080 acagtccgca ccgctgtccg gtagctcctt ccggtgggcg cggggcatga ctatcgtcgc 1140 cgcacttatg actgtcttct ttatcatgca actcgtagga caggtgccgg cagcgcccaa 1200 cagtcccccg gccacggggc ctgccaccat acccacgccg aaacaagcgc cctgcaccat 1260 tatgttccgg atctgcatcg caggatgctg ctggctaccc tgtggaacac ctacatctgt 1320 attaacgaag cgctaaccgt ttttatcagg ctctgggagg cagaataaat gatcatatcg 1380 tcaattatta cctccacggg gagagcctga gcaaactggc ctcaggcatt tgagaagcac 1440 acggtcacac tgcttccggt agtcaataaa ccggtaaacc agcaatagac ataagcggct 1500 atttaacgac cctgccctga accgacgacc gggtcgaatt tgctttcgaa tttctgccat 1560 tcatccgctt attatcactt attcaggcgt agcaccaggc gtttaagggc accaataact 1620 gccttaaaaa aattacgccc cgccctgcca ctcatcgcag tactgttgta attcattaag 1680 cattctgccg acatggaagc catcacagac ggcatgatga acctgaatcg ccagcggcat 1740 cagcaccttg tcgccttgcg tataatattt gcccatggtg aaaacggggg cgaagaagtt 1800 gtccatattg gccacgttta aatcaaaact ggtgaaactc acccagggat tggctgagac 1860 gaaaaacata ttctcaataa accctttagg gaaataggcc aggttttcac cgtaacacgc 1920 cacatcttgc gaatatatgt gtagaaactg ccggaaatcg tcgtggtatt cactccagag 1980 cgatgaaaac gtttcagttt gctcatggaa aacggtgtaa caagggtgaa cactatccca 2040 tatcaccagc tcaccgtctt tcattgccat acgaattcag aggtttgatg actttgacct 2100 ttggttttaa atttttgaaa aaaataaaaa aaggcgaagc ctattatata tttatcatat 2160 atattttaat cttttgttct tttgcgtgaa aaaaaaggca gtgttttcgc tagttataga 2220 aattaaacag tcacaaaaat cgatgtatac agtcacaaaa atcgattttt gtgactctat 2280 gcatcgatgt atacgagtca caaaaatcga tgtacacagt acgacttttg tatttgtgta 2340 ctgtatatag tataataaaa gcatagagaa aactcactat gaaatgactt tctctatgct 2400 actactaaaa cacgcaaagg agcgtattta tactatgatt ataccagaaa aacaaaacaa 2460 acaaaagcag gtgctgacct tgaatgaatt agaaaaacgc aaagtagtgg agcataacgc 2520 tttaattcaa agtgtggcta aaatgcaaaa aacagcttta aaaatgtttg agttagccgt 2580 gtcttgtatt gataccgaag aacccccaaa aaataatact gtttatcttt taaaatctga 2640 actttttaaa ttttttgaag tatcgtctag cagtaagcat agtcaattca aagaagcagt 2700 taattatatg caaaagcaag cctttttcaa tattaaagct gataagaaac taggtattga 2760 gtatgaaagc attgtgccta ttccttatgt aaaatggaac gattataacg atgaagtaac 2820 aatacgtttt gaccaagcca ttatgcccta tctgattgat ttaaaggcag aatttactca 2880 atataaaatt tccgaactac aaaaactcaa tagcaagtat tcgattattt tgtatcgttg 2940 gttatccatg aattataacc aatacgaaca ttacagcgtg aaagggggac ggagagttga 3000 acaagtggaa tcctaccgca acccctcaat aaaagtgaaa gaattgcgag aaataactga 3060 tacaataaat gaacatcaac attttcccca ttttgaaact agagtattaa aaaaagcaat 3120 tgaagaaatc aacgctcaca cctcttttaa tgtaacctat gagaaaaaga aagcagggcg 3180 gagcattgac tctattgtct ttcatattga gaagaaacgc atggcagacg ataacagtta 3240 caagttggga gataaagact atcaagacga taaaaagcaa aaatcaagaa atgaagctga 3300 cttactaaaa caggcaatgg aaagtaaata tacaagactt ttatctgaaa atttcttaat 3360 tggcatgaat gatattatgg atacagccac aatggtaggc ttacagaaga atgtttatcc 3420 gctttatgac gaactgaaag aattaagagg gctaaatggt gtcaaagacc acttgtctta 3480 tgttgccagc aaaaaagaag cctattctaa acgtaatgta gcgaaatatc ttaaaaaagc 3540 cattgaacaa tatctgccaa cggttaaaag gcaggactta aaccatgagt gaagacttaa 3600 aaacgataaa agagttggcg gacgagctag gactagctaa acagaccgtt caataccatt 3660 taaagttttt gccgaccgaa aatcgacaaa aaaacaatag tggttcgatc gtcctaaacc 3720 ttgaagaaca aaagtttatc aaaaaaagac taacaaaaat aaaaaaagtt caaccgaccg 3780 aaaatcgaca agaaaaaatg aagaattaag cgttaatagt cttttgttag gaattccgga 3840 tgagcattca tcaggcgggc aagaatgtga ataaaggccg gataaaactt gtgcttattt 3900 ttctttacgg tctttaaaaa ggccgtaata tccagctgaa cggtctggtt ataggtacat 3960 tgagcaactg actgaaatgc ctcaaaatgt tctttacgat gccattggga tatatcaacg 4020 gtggtatatc cagtgatttt tttctccatt ttagcttcct tagctcctga aaatctcgat 4080 aactcaaaaa atacgcccgg tagtgatctt atttcattat ggtgaaagtt ggaacctctt 4140 acgtgccgat caacgtctca ttttcgccaa aagttggccc agggcttccc ggtatcaaca 4200 gggacaccag gatttattta ttctgcgaag tgatcttccg tcacaggtat ttattcggcg 4260 caaagtgcgt cgggtgatgc tgccaactta ctgatttagt gtatgatggt gtttttgagg 4320 tgctccagtg gcttctgttt ctatcagctg tccctcctgt tcagctactg acggggtggt 4380 gcgtaacggc aaaagcaccg ccggacatca gcgctagcgg agtgtatact ggcttactat 4440 gttggcactg atgagggtgt cagtgaagtg cttcatgtgg caggagaaaa aaggctgcac 4500 cggtgcgtca gcagaatatg tgatacagga tatattccgc ttcctcgctc actgactcgc 4560 tacgctcggt cgttcgactg cggcgagcgg aaatggctta cgaacggggc ggagatttcc 4620 tggaagatgc caggaagata cttaacaggg aagtgagagg gccgcggcaa agccgttttt 4680 ccataggctc cgcccccctg acaagcatca cgaaatctga cgctcaaatc agtggtggcg 4740 aaacccgaca ggactataaa gataccaggc gtttccccct ggcggctccc tcgtgcgctc 4800 tcctgttcct gcctttcggt ttaccggtgt cattccgctg ttatggccgc gtttgtctca 4860 ttccacgcct gacactcagt tccgggtagg cagttcgctc caagctggac tgtatgcacg 4920 aaccccccgt tcagtccgac cgctgcgcct tatccggtaa ctatcgtctt gagtccaacc 4980 cggaaagaca tgcaaaagca ccactggcag cagccactgg taattgattt agaggagtta 5040 gtcttgaagt catgcgccgg ttaaggctaa actgaaagga caagttttgg tgactgcgct 5100 cctccaagcc agttacctcg gttcaaagag ttggtagctc agagaacctt cgaaaaaccg 5160 ccctgcaagg cggttttttc gttttcagag caagagatta cgcgcagacc aaaacgatct 5220 caagaagatc atcttattaa tcagataaaa tatttctaga tttcagtgca atttatctct 5280 tcaaatgtag cacctgaagt cagccccata cgatataagt tgtaattctc atgtttgaca 5340 gcttatcatc gatacaaatt ccccgtaggc gctagggacc tctttagctt cttggaagct 5400 gtcagtagta tatctaataa tttatctcca ttccctttag taacgtgtaa ctttccaaat 5460 ttaaaaaagc gactcataga attatttcct cccgttaaat aatagataac tattaaaaat 5520 agacaatact tgctcataag taatggtact taaattgttt actttggcgt gtttcattgc 5580 ttgatgaaac tgatttttag taaacagttg acgatattct cgattgaccc attttgaaac 5640 aaagtacgta tatagcttcc aatatttatc tggaacatct gtggtatggc gggtaagttt 5700 tattaagaca ctgtttactt ttggtttagg atgaaagcat tccgctggca gcttaagcaa 5760 ttgctgaatc gagacttgag tgtgcaagag caaccctagt gttcggtgaa tatccaaggt 5820 acgcttgtag aatccttctt caacaatcag atagatgtca gacgcacggc tttcaaaaac 5880 cactttttta ataatttgtg tgcttaaatg gtaaggaata ttcccaacaa ttttatacct 5940 ctgtttgtta gggaattgaa actgtagaat atcttggtga attaaagtga cacgagtatt 6000 cagttttaat ttttctgacg ataagttgaa tagatgactg tctaattcaa tagacgttac 6060 ctgtttactt attttagcca gtttcgtcgt taaatgccct ttacctgttc caatttcgta 6120 aacggtatcg gtttctttta aattcaattg ttttattatt tggttgagta ctttttcact 6180 cgttaaaaag ttttgagaat attttatatt tttgttcatg taatctctcc tgaagtgatt 6240 acatctataa acaaatacag aagttaaacg atttgtttgt aattttagtt atctgtttaa 6300 aaagtcataa gattagtcac tggtaggaat taatctaacg tatttattta tctgcgtaat 6360 cactgttttt agtctgtttc aaaacagtag atgttttatc tacattacgc atttggaata 6420 ccaacatgac gaatccctcc ttcttaatta caaattttta gcatctaatt taacttcaat 6480 tcctattata caaaatttta agataatgca ctatcaacac actcttaagt ttgcttctaa 6540 gtcttatttc cataacttct tttacgtttc cgccattctt tgctgtttcg atttttatga 6600 tatggtgcaa gtcagcacga acacgaaccg tcttatctcc cattatatct ttttttgcac 6660 tgattggtgt atcatttcgt ttttcttttt gtgcgcctaa atttcccaca atcactcact 6720 tctttctatt tcttcttatt cttattttat catcaacaat cacaaatcac ttgtgatttg 6780 tgataagtga tttgtgataa gtgatttgtg attaatatat aaaagccctc tttaaagggc 6840 ttttatgttt attttgagaa agatataaaa tcaatatatc ccttttcccc aatttttaca 6900 acggcattgt agggcttttt ctctttgttt ttgatttcct tttaccaggg tttcttttcc 6960 ctccagtaat tcttttacat tcgttttggt tagttttttc tttctaaaat gttcagctaa 7020 agtaaactta cattcaggat aatttgaaca accataaaac gattttttta atacaatatt 7080 gttgccacac ttaggacatt ttcctacaat acttttttct gcttcttttt ctttctgttc 7140 ctggtaatca gaaaaattta gtttttctat atcgttaggt acagcttcca gtaaatgaac 7200 aatgaatttt ttgatattcg taataaagtt ctcttgattg ccttctcttt taccgatttt 7260 ttttaaatac gtttcccatt tagccgtcat ttcagcactc gttaaaaggt gctgactttc 7320 aactgcctgg cacaataatt ttcctttttc agttacaaca agcttgcctg caggtcgacg 7380 ataaccacat aacagtcata aaaccactcc tttttaacaa actttatcac aagaaatatt 7440 ttggcattct acgactataa cttaaattta tattttttac tttataatat ataattgatt 7500 atagaataat gttgctcata tcgtttgcca acatctagta ctcaaattac actatgttac 7560 acttggtaat attaaccgaa cttcccctgt ccaaattaga taagaggtaa taataaatgg 7620 aaaataattt tatagtaaat gaaaatgaga agcgtgtatt aaaacaaatt ttcaataaca 7680 gcaatatttc acgaacacaa atatcgaaga atttagaact taataaagct actatttcta 7740 acattctgaa caacttaaaa cacaagagtt tagttaatga agtaggagaa ggtaatagta 7800 ctaaaagtgg tggacgaaag cctattttac tcgaaattaa ccaaaaatat ggctactata 7860 tttctatgga tttaacatat gattccgttg aattaatgta caactacttt gatgctacta 7920 tattaaagca agattcctac gaattaaatg ataaaaatgt aagcagtata ttacaaattt 7980 taaaatctaa tataaacgtc tcagaaaaat atgatacgtt atatgggtta cttggtatat 8040 ctatatccat acacggtatc gttgacgatg agcaaaacat aatcaatctt ccttttcata 8100 aaaatgagaa acgcacattt accgatgaat taaagtcatt cacaaatgtt cctgtcgtta 8160 tagaaaatga agcaaattta tcagcgctat atgaaaaaag tttatatatt aattcaaaca 8220 taaataattt gattacttta agtattcaca agggtatagg cgctggcatc ataataaata 8280 aaaaacttta tcgtggctca aatggagagg ctggagagat aggtaagaca ttggttttgg 8340 aatctataaa taacaatgac aacaaatatt ataaaatcga agatatatgc tcccaagacg 8400 ctttaataca gaaaataaat aataggttgg gcgtcacatt gacgtttaca gaactaatcc 8460 aatattacaa cgaaggaaat tcaattgttg ctcatgaaat taaacaattt attaataaaa 8520 tgacagttct gattcataat ttgaatacac aatttaaccc agacgctatt tatattaact 8580 gtcctttaat taatgaatta ccaaatattt taaatgaaat taaagagcaa ttctcctgtt 8640 tttctcaagg cagtccaatt caattacatt taactactaa tgtaaaacaa gctactttat 8700 tgggtggcac tttagcaata atgcaaaaaa cattaaatat aaataacatt caaatgaata 8760 ttaaataatt acagcagtct gagttataaa atagatatct cggaccggtg agcttggact 8820 agaaaaaaac ttcacaaaat gctatactag gtaggtaact cgagagttag tttgtttatt 8880 aaattaacca actggacgaa ttcccgggga tccgtcgacc tgcagccaag cttgggcttt 8940 tcagcctgat acagattaaa tcagaacgca gaagcggtct gataaaacag aatttgcctg 9000 gcggcagtag cgcggtggtc ccacctgacc ccatgccgaa ctcagaagtg aaacgccgta 9060 gcgccgatgg tagtgtgggg tctccccatg cgagagtagg gaactgccag gcatcaaata 9120 aaacgaaagg ctcagtcgaa agactgggcc tttcgtttta tctgttgttt gtcggtgaac 9180 gctctcctga gtaggacaaa tccgccggga gcggatttga acgttgcgaa gcaacggccc 9240 ggagggtggc gggcaggacg cccgccataa actgccaggc atcaaattaa gcagaaggcc 9300 atcctgacgg atggcctttt tgcgtttcta caaactcttt ttgtttattt ttctaaatac 9360 attcaaatat gtatccgctg atccccatcc tatcgatccc cacccgatgc ccttgagagc 9420 cttca 9425 47 12733 DNA Artificial Sequence Vector pEPEF14 47 aagcttgcct gcaggtcgac gataaccaca taacagtcat aaaaccactc ctttttaaca 60 aactttatca caagaaatat tttggcattc tacgactata acttaaattt atatttttta 120 ctttataata tataattgat tatagaataa tgttgctcat atcgtttgcc aacatctagt 180 actcaaatta cactatgtta cacttggtaa tattaaccga acttcccctg tccaaattag 240 ataagaggta ataataaatg gaaaataatt ttatagtaaa tgaaaatgag aagcgtgtat 300 taaaacaaat tttcaataac agcaatattt cacgaacaca aatatcgaag aatttagaac 360 ttaataaagc tactatttct aacattctga acaacttaaa acacaagagt ttagttaatg 420 aagtaggaga aggtaatagt actaaaagtg gtggacgaaa gcctatttta ctcgaaatta 480 accaaaaata tggctactat atttctatgg atttaacata tgattccgtt gaattaatgt 540 acaactactt tgatgctact atattaaagc aagattccta cgaattaaat gataaaaatg 600 taagcagtat attacaaatt ttaaaatcta atataaacgt ctcagaaaaa tatgatacgt 660 tatatgggtt acttggtata tctatatcca tacacggtat cgttgacgat gagcaaaaca 720 taatcaatct tccttttcat aaaaatgaga aacgcacatt taccgatgaa ttaaagtcat 780 tcacaaatgt tcctgtcgtt atagaaaatg aagcaaattt atcagcgcta tatgaaaaaa 840 gtttatatat taattcaaac ataaataatt tgattacttt aagtattcac aagggtatag 900 gcgctggcat cataataaat aaaaaacttt atcgtggctc aaatggagag gctggagaga 960 taggtaagac attggttttg gaatctataa ataacaatga caacaaatat tataaaatcg 1020 aagatatatg ctcccaagac gctttaatac agaaaataaa taataggttg ggcgtcacat 1080 tgacgtttac agaactaatc caatattaca acgaaggaaa ttcaattgtt gctcatgaaa 1140 ttaaacaatt tattaataaa atgacagttc tgattcataa tttgaataca caatttaacc 1200 cagacgctat ttatattaac tgtcctttaa ttaatgaatt accaaatatt ttaaatgaaa 1260 ttaaagagca attctcctgt ttttctcaag gcagtccaat tcaattacat ttaactacta 1320 atgtaaaaca agctacttta ttgggtggca ctttagcaat aatgcaaaaa acattaaata 1380 taaataacat tcaaatgaat attaaataat tacagcagtc tgagttataa aatagatatc 1440 tcggaccgac attaaattct tgacagggag agataggttt gatagaatat aatagttgtc 1500 tcgagagtta gtttgtttat taaattaacc aactggatcc cgggtctaga ttagggtaac 1560 tttgaaagga tattcctcat gcaagctaat cttcaatggt tagatgaccc agaagtcttc 1620 cgggtcaacc aattacctgc acatagtgat caccattatt atcacgacac agcagaattc 1680 aaaacgggta gtcgcttcat caagagtctc aatggcgctt ggcgttttaa cttcgccaag 1740 acaccggctg aacgcccagt tgatttttat caacccgatt tcgatgcaac cgactttgat 1800 acgattcaag ttcccggtca tattgaacta gccggctatg gtcaaattca atacattaac 1860 acgctatacc catgggaagg taaaatttac cgtcgcccac cgtataccct caatcaagat 1920 caattaacac caggcctatt cagcgacgct gcggacaaca ccgtcggctc gtacctcaaa 1980 accttcgatc tcgacgatgt ttttaaaggg caacgtatta tcattcagtt ccaaggggta 2040 gaagaagccc tgtacgtctg gttaaatggc cattttattg gctactctga agatagtttc 2100 accccttcag aatttgattt gacgccgtat attcaggacc aaggtaacgt tttagcggtt 2160 cgggtctaca aacacagtac tgctgccttt attgaagasc caagatatgt tccgtttctc 2220 tggtattttc cgtgacgtca atatactggc ggagcctgct agccatatta ctgatttgga 2280 catccgacca gttccaaatg ccaatctcaa aagtggtgag ctcaacatca ctactaaagt 2340 aaccggcgaa ccagccactt tagcgctgac cgttaaagac catgacgggc gagtactgac 2400 gagtcaaacg caaaccggta gtggcagtgt aacctttgat actatgttat tcgaccaact 2460 gcacttgtgg tcaccacaaa cgccgtatct ctatcaattg acaattgaag tttacgatgc 2520 tgatcaccaa ctcttggaag tcgtcccata tcagtttggg ttccggacgg tcgagctgcg 2580 cgatgacaaa gtcatttacg tcaacaataa acggttggtg atcaacgggg ttaaccggca 2640 cgaatggaac gcccacaccg gtcgcgttat cagtatggct gatatgcgcg ctgatatcca 2700 aaccatgtta gctaacaata tcaatgccga tcggacctgc cattatcctg atcaattacc 2760 ttggtatcaa ttatgtgacg aggccggtat ctacctaatg gccgaaacca acctcgaatc 2820 gcacgggtca tggcaaaaga tgggggctat cgagccttct tacaatgttc ctggcgataa 2880 tccacactgg ccagcagcgg tgatcgaccg ggcccgttca aactacgaat ggtttaaaaa 2940 ccacccatca atcatttttt ggtcacttgg caatgaatcg tatgctggcg aagatatcgc 3000 ggcgatgcag gctttttata aagaacacga tgattcacga ctcgtccact acgaaggcgt 3060 tttctacaca ccagaattaa aagatcgcat ttctgatgtt gaaagtcgga tgtacgaaaa 3120 gccccaaaat attgtagctt acttggaaga taacccaacc aaacctttcc taaattgtga 3180 atatatgcat gacatgggga attctctggg cggtatgcaa tcatataatg atttgatcga 3240 caagtatcca atgtatcaag gtggctttat ttgggacttt attgatcaag cactcttcgt 3300 tcatgaccca atcaccgacc aagacgtgct ccggtatggc ggtgatttcg acgaacgcca 3360 ctccgattat gcattctccg gtaacggctt aatgtttgcc gaccggacac caaaaccagc 3420 aatgcaagag gtgaaatatt attatggctt acacaaataa tcaactacac gttatttacg 3480 gcgacgggag cttaggacta cagggggcta atttccacta cctctttagc tacgaacgtg 3540 ggggacttga atcactcgtc gtcaacgata aagagtggct ctatcgtaca cccacgccca 3600 tgttttggcg ggcgacaacc gataatgatc acggtagcgg cttttcagtc aaatccgcac 3660 agtggtacgc ggccgataag ttctcaactt gtcaagatat cgaattgacg gttgacgacc 3720 aaccagtcac accgttacca atcgcgccac tcaataacaa atacacggat cacgaaatcg 3780 ccacgaaagt ctctctggct taccacttcg ttaccacgac cgttcctagt accatcgtca 3840 cagtgactta tacggtgaca gcagacggtc agatcaatat cgccacccat tatagcggtc 3900 agtctgattt gccagagcta cccgcatttg gtctgcggtt tatcatgcca actaccgcga 3960 ccggcttcga ctataccggt ttgtccggtg agacttatcc tgaccggctg gctggcgcaa 4020 cacacgggca attccacgtt gacagtctgc cagtcacacc atacttggtc ccacaagaat 4080 gcggcatgca catgcaaact gaacaagtga cagtaacgcg atcaacaaca caaaataacg 4140 ctgaccacga caacacacca ttcagtttaa catttagcca aaccgatgca ccattcgcct 4200 tcagctgcct tccctatacc gctgctgaac tagaaaacgc aacacacatg gaagaattac 4260 cattagcacg gcgaacggtc ttatcaatct acggtgccgt tcgtggggtc ggtggcattg 4320 acagttgggg aacggacgta gaagccccat atcatatcct cgctaatcaa gacattgact 4380 tcagctttaa cattcatttc taaaaattaa tttgatttca aaagaatcgc tccggcagtt 4440 atttgccaga gcgttctttt aatatgtttt atggctgagc tttagtcctt tgaagtgggg 4500 gggggggggg gggggggggg gggggggggg gggggggggg gggggggggg gggggggggg 4560 gggggggggg gggggggggg gggggggggg gggggggggg gggggggggg gggggggggg 4620 gggggggggg gggggggggg gggggggggg gggggggggg gggggggggg gggggggggg 4680 gggggggggg gggggggggg gggggggggg gggggggggg gggggggggg gggggggggg 4740 gggggggggg gggggggggg gggggggggg gggggggggg gggggggggg gggggggggg 4800 gggggggggg gggggggggg gggggggggg gggggggggg gggggggggg gggggggggg 4860 gggggggggg gggggggggg gggggggggg gggggggggg gggggggggg gggggggggg 4920 gggggggggg gggggggggg gggggggggg gggggggggg gggggggggg gggggggggg 4980 gggggggggg gggggggggg gggggggggg gggggggggg gggggggggg gggggggggg 5040 gggggggggg gggggggggg gggggggggg gggggggggg gggggggggg gggggggggg 5100 gggggggggg gggggggggg gggggggggg gggggggggg gggggggggg gggggggggg 5160 gggggggggg gggggggggg gggggggggg gggggggggg gggggggggg gggggggggg 5220 gggggggggg gggggggggg gggggggggg gggggggggg gggggggggg gggggggggg 5280 gggggggggg gggggggggg gggggggggg gggggggggg gggggggggg gggggggggg 5340 ggggggggtc gaccgatgcc cttgagagcc ttcaacccag tcagctcctt ccggtgggcg 5400 cggggcatga ctatcgtcgc cgcacttatg actgtcttct ttatcatgca actcgtagga 5460 caggtgccgg cagcgctctg ggtcattttc ggcgaggacc gctttcgctg gagcgcgacg 5520 atgatcggcc tgtcgcttgc ggtattcgga atcttgcacg ccctcgctca agccttcgtc 5580 actggtcccg ccaccaaacg tttcggcgag aagcaggcca ttatcgccgg catggcggcc 5640 gacgcgctgg gctacgtctt gctggcgttc gcgacgcgag gctggatggc cttccccatt 5700 atgattcttc tcgcttccgg cggcatcggg atgcccgcgt tgcaggccat gctgtccagg 5760 caggtagatg acgaccatca gggacagctt caaggatcgc tcgcggctct taccagccta 5820 acttcgatca ctggaccgct gatcgtcacg gcgatttatg ccgcctcggc gagcacatgg 5880 aacgggttgg catggattgt aggcgccgcc ctataccttg tctgcctccc cgcgttgcgt 5940 cgcggtgcat ggagccgggc cacctcgacc tgaatggaag ccggcggcac ctcgctaacg 6000 gattcaccac tccaagaatt ggagccaatc aattcttgcg gagaactgtg aatgcgcaaa 6060 ccaacccttg gcagaacata tccatcgcgt ccgccatctc cagcagccgc acgcggcgca 6120 tctcgggcag cgttgggtcc tggccacggg tgcgcatgat cgtgctcctg tcgttgagga 6180 cccggctagg ctggcggggt tgccttactg gttagcagaa tgaatcaccg atacgcgagc 6240 gaacgtgaag cgactgctgc tgcaaaacgt ctgcgacctg agcaacaaca tgaatggtct 6300 tcggtttccg tgtttcgtaa agtctggaaa cgcggaagtc ccctacgtgc tgctgaagtt 6360 gcccgcaaca gagagtggaa ccaaccggtg ataccacgat actatgactg agagtcaacg 6420 ccatgagcgg cctcatttct tattctgagt tacaacagtc cgcaccgctg tccggtagct 6480 ccttccggtg ggcgcggggc atgactatcg tcgccgcact tatgactgtc ttctttatca 6540 tgcaactcgt aggacaggtg ccggcagcgc ccaacagtcc cccggccacg gggcctgcca 6600 ccatacccac gccgaaacaa gcgccctgca ccattatgtt ccggatctgc atcgcaggat 6660 gctgctggct accctgtgga acacctacat ctgtattaac gaagcgctaa ccgtttttat 6720 caggctctgg gaggcagaat aaatgatcat atcgtcaatt attacctcca cggggagagc 6780 ctgagcaaac tggcctcagg catttgagaa gcacacggtc acactgcttc cggtagtcaa 6840 taaaccggta aaccagcaat agacataagc ggctatttaa cgaccctgcc ctgaaccgac 6900 gaccgggtcg aatttgcttt cgaatttctg ccattcatcc gcttattatc acttattcag 6960 gcgtagcacc aggcgtttaa gggcaccaat aactgcctta aaaaaattac gccccgccct 7020 gccactcatc gcagtactgt tgtaattcat taagcattct gccgacatgg aagccatcac 7080 agacggcatg atgaacctga atcgccagcg gcatcagcac cttgtcgcct tgcgtataat 7140 atttgcccat ggtgaaaacg ggggcgaaga agttgtccat attggccacg tttaaatcaa 7200 aactggtgaa actcacccag ggattggctg agacgaaaaa catattctca ataaaccctt 7260 tagggaaata ggccaggttt tcaccgtaac acgccacatc ttgcgaatat atgtgtagaa 7320 actgccggaa atcgtcgtgg tattcactcc agagcgatga aaacgtttca gtttgctcat 7380 ggaaaacggt gtaacaaggg tgaacactat cccatatcac cagctcaccg tctttcattg 7440 ccatacgaat tcagaggttt gatgactttg acctttggtt ttaaattttt gaaaaaaata 7500 aaaaaaggcg aagcctatta tatatttatc atatatattt taatcttttg ttcttttgcg 7560 tgaaaaaaaa ggcagtgttt tcgctagtta tagaaattaa acagtcacaa aaatcgatgt 7620 atacagtcac aaaaatcgat ttttgtgact ctatgcatcg atgtatacga gtcacaaaaa 7680 tcgatgtaca cagtacgact tttgtatttg tgtactgtat atagtataat aaaagcatag 7740 agaaaactca ctatgaaatg actttctcta tgctactact aaaacacgca aaggagcgta 7800 tttatactat gattatacca gaaaaacaaa acaaacaaaa gcaggtgctg accttgaatg 7860 aattagaaaa acgcaaagta gtggagcata acgctttaat tcaaagtgtg gctaaaatgc 7920 aaaaaacagc tttaaaaatg tttgagttag ccgtgtcttg tattgatacc gaagaacccc 7980 caaaaaataa tactgtttat cttttaaaat ctgaactttt taaatttttt gaagtatcgt 8040 ctagcagtaa gcatagtcaa ttcaaagaag cagttaatta tatgcaaaag caagcctttt 8100 tcaatattaa agctgataag aaactaggta ttgagtatga aagcattgtg cctattcctt 8160 atgtaaaatg gaacgattat aacgatgaag taacaatacg ttttgaccaa gccattatgc 8220 cctatctgat tgatttaaag gcagaattta ctcaatataa aatttccgaa ctacaaaaac 8280 tcaatagcaa gtattcgatt attttgtatc gttggttatc catgaattat aaccaatacg 8340 aacattacag cgtgaaaggg ggacggagag ttgaacaagt ggaatcctac cgcaacccct 8400 caataaaagt gaaagaattg cgagaaataa ctgatacaat aaatgaacat caacattttc 8460 cccattttga aactagagta ttaaaaaaag caattgaaga aatcaacgct cacacctctt 8520 ttaatgtaac ctatgagaaa aagaaagcag ggcggagcat tgactctatt gtctttcata 8580 ttgagaagaa acgcatggca gacgataaca gttacaagtt gggagataaa gactatcaag 8640 acgataaaaa gcaaaaatca agaaatgaag ctgacttact aaaacaggca atggaaagta 8700 aatatacaag acttttatct gaaaatttct taattggcat gaatgatatt atggatacag 8760 ccacaatggt aggcttacag aagaatgttt atccgcttta tgacgaactg aaagaattaa 8820 gagggctaaa tggtgtcaaa gaccacttgt cttatgttgc cagcaaaaaa gaagcctatt 8880 ctaaacgtaa tgtagcgaaa tatcttaaaa aagccattga acaatatctg ccaacggtta 8940 aaaggcagga cttaaaccat gagtgaagac ttaaaaacga taaaagagtt ggcggacgag 9000 ctaggactag ctaaacagac cgttcaatac catttaaagt ttttgccgac cgaaaatcga 9060 caaaaaaaca atagtggttc gatcgtccta aaccttgaag aacaaaagtt tatcaaaaaa 9120 agactaacaa aaataaaaaa agttcaaccg accgaaaatc gacaagaaaa aatgaagaat 9180 taagcgttaa tagtcttttg ttaggaattc cggatgagca ttcatcaggc gggcaagaat 9240 gtgaataaag gccggataaa acttgtgctt atttttcttt acggtcttta aaaaggccgt 9300 aatatccagc tgaacggtct ggttataggt acattgagca actgactgaa atgcctcaaa 9360 atgttcttta cgatgccatt gggatatatc aacggtggta tatccagtga tttttttctc 9420 cattttagct tccttagctc ctgaaaatct cgataactca aaaaatacgc ccggtagtga 9480 tcttatttca ttatggtgaa agttggaacc tcttacgtgc cgatcaacgt ctcattttcg 9540 ccaaaagttg gcccagggct tcccggtatc aacagggaca ccaggattta tttattctgc 9600 gaagtgatct tccgtcacag gtatttattc ggcgcaaagt gcgtcgggtg atgctgccaa 9660 cttactgatt tagtgtatga tggtgttttt gaggtgctcc agtggcttct gtttctatca 9720 gctgtccctc ctgttcagct actgacgggg tggtgcgtaa cggcaaaagc accgccggac 9780 atcagcgcta gcggagtgta tactggctta ctatgttggc actgatgagg gtgtcagtga 9840 agtgcttcat gtggcaggag aaaaaaggct gcaccggtgc gtcagcagaa tatgtgatac 9900 aggatatatt ccgcttcctc gctcactgac tcgctacgct cggtcgttcg actgcggcga 9960 gcggaaatgg cttacgaacg gggcggagat ttcctggaag atgccaggaa gatacttaac 10020 agggaagtga gagggccgcg gcaaagccgt ttttccatag gctccgcccc cctgacaagc 10080 atcacgaaat ctgacgctca aatcagtggt ggcgaaaccc gacaggacta taaagatacc 10140 aggcgtttcc ccctggcggc tccctcgtgc gctctcctgt tcctgccttt cggtttaccg 10200 gtgtcattcc gctgttatgg ccgcgtttgt ctcattccac gcctgacact cagttccggg 10260 taggcagttc gctccaagct ggactgtatg cacgaacccc ccgttcagtc cgaccgctgc 10320 gccttatccg gtaactatcg tcttgagtcc aacccggaaa gacatgcaaa agcaccactg 10380 gcagcagcca ctggtaattg atttagagga gttagtcttg aagtcatgcg ccggttaagg 10440 ctaaactgaa aggacaagtt ttggtgactg cgctcctcca agccagttac ctcggttcaa 10500 agagttggta gctcagagaa ccttcgaaaa accgccctgc aaggcggttt tttcgttttc 10560 agagcaagag attacgcgca gaccaaaacg atctcaagaa gatcatctta ttaatcagat 10620 aaaatatttc tagatttcag tgcaatttat ctcttcaaat gtagcacctg aagtcagccc 10680 catacgatat aagttgtaat tctcatgttt gacagcttat catcgataca aattccccgt 10740 aggcgctagg gacctcttta gcttcttgga agctgtcagt agtatatcta ataatttatc 10800 tccattccct ttagtaacgt gtaactttcc aaatttaaaa aagcgactca tagaattatt 10860 tcctcccgtt aaataataga taactattaa aaatagacaa tacttgctca taagtaatgg 10920 tacttaaatt gtttactttg gcgtgtttca ttgcttgatg aaactgattt ttagtaaaca 10980 gttgacgata ttctcgattg acccattttg aaacaaagta cgtatatagc ttccaatatt 11040 tatctggaac atctgtggta tggcgggtaa gttttattaa gacactgttt acttttggtt 11100 taggatgaaa gcattccgct ggcagcttaa gcaattgctg aatcgagact tgagtgtgca 11160 agagcaaccc tagtgttcgg tgaatatcca aggtacgctt gtagaatcct tcttcaacaa 11220 tcagatagat gtcagacgca cggctttcaa aaaccacttt tttaataatt tgtgtgctta 11280 aatggtaagg aatattccca acaattttat acctctgttt gttagggaat tgaaactgta 11340 gaatatcttg gtgaattaaa gtgacacgag tattcagttt taatttttct gacgataagt 11400 tgaatagatg actgtctaat tcaatagacg ttacctgttt acttatttta gccagtttcg 11460 tcgttaaatg ccctttacct gttccaattt cgtaaacggt atcggtttct tttaaattca 11520 attgttttat tatttggttg agtacttttt cactcgttaa aaagttttga gaatatttta 11580 tatttttgtt catgtaatct ctcctgaagt gattacatct ataaacaaat acagaagtta 11640 aacgatttgt ttgtaatttt agttatctgt ttaaaaagtc ataagattag tcactggtag 11700 gaattaatct aacgtattta tttatctgcg taatcactgt ttttagtctg tttcaaaaca 11760 gtagatgttt tatctacatt acgcatttgg aataccaaca tgacgaatcc ctccttctta 11820 attacaaatt tttagcatct aatttaactt caattcctat tatacaaaat tttaagataa 11880 tgcactatca acacactctt aagtttgctt ctaagtctta tttccataac ttcttttacg 11940 tttccgccat tctttgctgt ttcgattttt atgatatggt gcaagtcagc acgaacacga 12000 accgtcttat ctcccattat atcttttttt gcactgattg gtgtatcatt tcgtttttct 12060 ttttgtgcgc ctaaatttcc cacaatcact cacttctttc tatttcttct tattcttatt 12120 ttatcatcaa caatcacaaa tcacttgtga tttgtgataa gtgatttgtg ataagtgatt 12180 tgtgattaat atataaaagc cctctttaaa gggcttttat gtttattttg agaaagatat 12240 aaaatcaata tatccctttt ccccaatttt tacaacggca ttgtagggct ttttctcttt 12300 gtttttgatt tccttttacc agggtttctt ttccctccag taattctttt acattcgttt 12360 tggttagttt tttctttcta aaatgttcag ctaaagtaaa cttacattca ggataatttg 12420 aacaaccata aaacgatttt tttaatacaa tattgttgcc acacttagga cattttccta 12480 caatactttt ttctgcttct ttttctttct gttcctggta atcagaaaaa tttagttttt 12540 ctatatcgtt aggtacagct tccagtaaat gaacaatgaa ttttttgata ttcgtaataa 12600 agttctcttg attgccttct cttttaccga ttttttttaa atacgtttcc catttagccg 12660 tcatttcagc actcgttaaa aggtgctgac tttcaactgc ctggcacaat aattttcctt 12720 tttcagttac aac 12733 48 9417 DNA Artificial Sequence Vector pEPEF14* 48 acccgatgcc cttgagagcc ttcaacccag tcagctcctt ccggtgggcg cggggcatga 60 ctatcgtcgc cgcacttatg actgtcttct ttatcatgca actcgtagga caggtgccgg 120 cagcgctctg ggtcattttc ggcgaggacc gctttcgctg gagcgcgacg atgatcggcc 180 tgtcgcttgc ggtattcgga atcttgcacg ccctcgctca agccttcgtc actggtcccg 240 ccaccaaacg tttcggcgag aagcaggcca ttatcgccgg catggcggcc gacgcgctgg 300 gctacgtctt gctggcgttc gcgacgcgag gctggatggc cttccccatt atgattcttc 360 tcgcttccgg cggcatcggg atgcccgcgt tgcaggccat gctgtccagg caggtagatg 420 acgaccatca gggacagctt caaggatcgc tcgcggctct taccagccta acttcgatca 480 ctggaccgct gatcgtcacg gcgatttatg ccgcctcggc gagcacatgg aacgggttgg 540 catggattgt aggcgccgcc ctataccttg tctgcctccc cgcgttgcgt cgcggtgcat 600 ggagccgggc cacctcgacc tgaatggaag ccggcggcac ctcgctaacg gattcaccac 660 tccaagaatt ggagccaatc aattcttgcg gagaactgtg aatgcgcaaa ccaacccttg 720 gcagaacata tccatcgcgt ccgccatctc cagcagccgc acgcggcgca tctcgggcag 780 cgttgggtcc tggccacggg tgcgcatgat cgtgctcctg tcgttgagga cccggctagg 840 ctggcggggt tgccttactg gttagcagaa tgaatcaccg atacgcgagc gaacgtgaag 900 cgactgctgc tgcaaaacgt ctgcgacctg agcaacaaca tgaatggtct tcggtttccg 960 tgtttcgtaa agtctggaaa cgcggaagtc ccctacgtgc tgctgaagtt gcccgcaaca 1020 gagagtggaa ccaaccggtg ataccacgat actatgactg agagtcaacg ccatgagcgg 1080 cctcatttct tattctgagt tacaacagtc cgcaccgctg tccggtagct ccttccggtg 1140 ggcgcggggc atgactatcg tcgccgcact tatgactgtc ttctttatca tgcaactcgt 1200 aggacaggtg ccggcagcgc ccaacagtcc cccggccacg gggcctgcca ccatacccac 1260 gccgaaacaa gcgccctgca ccattatgtt ccggatctgc atcgcaggat gctgctggct 1320 accctgtgga acacctacat ctgtattaac gaagcgctaa ccgtttttat caggctctgg 1380 gaggcagaat aaatgatcat atcgtcaatt attacctcca cggggagagc ctgagcaaac 1440 tggcctcagg catttgagaa gcacacggtc acactgcttc cggtagtcaa taaaccggta 1500 aaccagcaat agacataagc ggctatttaa cgaccctgcc ctgaaccgac gaccgggtcg 1560 aatttgcttt cgaatttctg ccattcatcc gcttattatc acttattcag gcgtagcacc 1620 aggcgtttaa gggcaccaat aactgcctta aaaaaattac gccccgccct gccactcatc 1680 gcagtactgt tgtaattcat taagcattct gccgacatgg aagccatcac agacggcatg 1740 atgaacctga atcgccagcg gcatcagcac cttgtcgcct tgcgtataat atttgcccat 1800 ggtgaaaacg ggggcgaaga agttgtccat attggccacg tttaaatcaa aactggtgaa 1860 actcacccag ggattggctg agacgaaaaa catattctca ataaaccctt tagggaaata 1920 ggccaggttt tcaccgtaac acgccacatc ttgcgaatat atgtgtagaa actgccggaa 1980 atcgtcgtgg tattcactcc agagcgatga aaacgtttca gtttgctcat ggaaaacggt 2040 gtaacaaggg tgaacactat cccatatcac cagctcaccg tctttcattg ccatacgaga 2100 ggtttgatga ctttgacctt tggttttaaa tttttgaaaa aaataaaaaa aggcgaagcc 2160 tattatatat ttatcatata tattttaatc ttttgttctt ttgcgtgaaa aaaaaggcag 2220 tgttttcgct agttatagaa attaaacagt cacaaaaatc gatgtataca gtcacaaaaa 2280 tcgatttttg tgactctatg catcgatgta tacgagtcac aaaaatcgat gtacacagta 2340 cgacttttgt atttgtgtac tgtatatagt ataataaaag catagagaaa actcactatg 2400 aaatgacttt ctctatgcta ctactaaaac acgcaaagga gcgtatttat actatgatta 2460 taccagaaaa acaaaacaaa caaaagcagg tgctgacctt gaatgaatta gaaaaacgca 2520 aagtagtgga gcataacgct ttaattcaaa gtgtggctaa aatgcaaaaa acagctttaa 2580 aaatgtttga gttagccgtg tcttgtattg ataccgaaga acccccaaaa aataatactg 2640 tttatctttt aaaatctgaa ctttttaaat tttttgaagt atcgtctagc agtaagcata 2700 gtcaattcaa agaagcagtt aattatatgc aaaagcaagc ctttttcaat attaaagctg 2760 ataagaaact aggtattgag tatgaaagca ttgtgcctat tccttatgta aaatggaacg 2820 attataacga tgaagtaaca atacgttttg accaagccat tatgccctat ctgattgatt 2880 taaaggcaga atttactcaa tataaaattt ccgaactaca aaaactcaat agcaagtatt 2940 cgattatttt gtatcgttgg ttatccatga attataacca atacgaacat tacagcgtga 3000 aagggggacg gagagttgaa caagtggaat cctaccgcaa cccctcaata aaagtgaaag 3060 aattgcgaga aataactgat acaataaatg aacatcaaca ttttccccat tttgaaacta 3120 gagtattaaa aaaagcaatt gaagaaatca acgctcacac ctcttttaat gtaacctatg 3180 agaaaaagaa agcagggcgg agcattgact ctattgtctt tcatattgag aagaaacgca 3240 tggcagacga taacagttac aagttgggag ataaagacta tcaagacgat aaaaagcaaa 3300 aatcaagaaa tgaagctgac ttactaaaac aggcaatgga aagtaaatat acaagacttt 3360 tatctgaaaa tttcttaatt ggcatgaatg atattatgga tacagccaca atggtaggct 3420 tacagaagaa tgtttatccg ctttatgacg aactgaaaga attaagaggg ctaaatggtg 3480 tcaaagacca cttgtcttat gttgccagca aaaaagaagc ctattctaaa cgtaatgtag 3540 cgaaatatct taaaaaagcc attgaacaat atctgccaac ggttaaaagg caggacttaa 3600 accatgagtg aagacttaaa aacgataaaa gagttggcgg acgagctagg actagctaaa 3660 cagaccgttc aataccattt aaagtttttg ccgaccgaaa atcgacaaaa aaacaatagt 3720 ggttcgatcg tcctaaacct tgaagaacaa aagtttatca aaaaaagact aacaaaaata 3780 aaaaaagttc aaccgaccga aaatcgacaa gaaaaaatga agaattaagc gttaatagtc 3840 ttttgttagg cggatgagca ttcatcaggc gggcaagaat gtgaataaag gccggataaa 3900 acttgtgctt atttttcttt acggtcttta aaaaggccgt aatatccagc tgaacggtct 3960 ggttataggt acattgagca actgactgaa atgcctcaaa atgttcttta cgatgccatt 4020 gggatatatc aacggtggta tatccagtga tttttttctc cattttagct tccttagctc 4080 ctgaaaatct cgataactca aaaaatacgc ccggtagtga tcttatttca ttatggtgaa 4140 agttggaacc tcttacgtgc cgatcaacgt ctcattttcg ccaaaagttg gcccagggct 4200 tcccggtatc aacagggaca ccaggattta tttattctgc gaagtgatct tccgtcacag 4260 gtatttattc ggcgcaaagt gcgtcgggtg atgctgccaa cttactgatt tagtgtatga 4320 tggtgttttt gaggtgctcc agtggcttct gtttctatca gctgtccctc ctgttcagct 4380 actgacgggg tggtgcgtaa cggcaaaagc accgccggac atcagcgcta gcggagtgta 4440 tactggctta ctatgttggc actgatgagg gtgtcagtga agtgcttcat gtggcaggag 4500 aaaaaaggct gcaccggtgc gtcagcagaa tatgtgatac aggatatatt ccgcttcctc 4560 gctcactgac tcgctacgct cggtcgttcg actgcggcga gcggaaatgg cttacgaacg 4620 gggcggagat ttcctggaag atgccaggaa gatacttaac agggaagtga gagggccgcg 4680 gcaaagccgt ttttccatag gctccgcccc cctgacaagc atcacgaaat ctgacgctca 4740 aatcagtggt ggcgaaaccc gacaggacta taaagatacc aggcgtttcc ccctggcggc 4800 tccctcgtgc gctctcctgt tcctgccttt cggtttaccg gtgtcattcc gctgttatgg 4860 ccgcgtttgt ctcattccac gcctgacact cagttccggg taggcagttc gctccaagct 4920 ggactgtatg cacgaacccc ccgttcagtc cgaccgctgc gccttatccg gtaactatcg 4980 tcttgagtcc aacccggaaa gacatgcaaa agcaccactg gcagcagcca ctggtaattg 5040 atttagagga gttagtcttg aagtcatgcg ccggttaagg ctaaactgaa aggacaagtt 5100 ttggtgactg cgctcctcca agccagttac ctcggttcaa agagttggta gctcagagaa 5160 ccttcgaaaa accgccctgc aaggcggttt tttcgttttc agagcaagag attacgcgca 5220 gaccaaaacg atctcaagaa gatcatctta ttaatcagat aaaatatttc tagatttcag 5280 tgcaatttat ctcttcaaat gtagcacctg aagtcagccc catacgatat aagttgtaat 5340 tctcatgttt gacagcttat catcgataca aattccccgt aggcgctagg gacctcttta 5400 gcttcttgga agctgtcagt agtatatcta ataatttatc tccattccct ttagtaacgt 5460 gtaactttcc aaatttaaaa aagcgactca tagaattatt tcctcccgtt aaataataga 5520 taactattaa aaatagacaa tacttgctca taagtaatgg tacttaaatt gtttactttg 5580 gcgtgtttca ttgcttgatg aaactgattt ttagtaaaca gttgacgata ttctcgattg 5640 acccattttg aaacaaagta cgtatatagc ttccaatatt tatctggaac atctgtggta 5700 tggcgggtaa gttttattaa gacactgttt acttttggtt taggatgaaa gcattccgct 5760 ggcagcttaa gcaattgctg aatcgagact tgagtgtgca agagcaaccc tagtgttcgg 5820 tgaatatcca aggtacgctt gtagaatcct tcttcaacaa tcagatagat gtcagacgca 5880 cggctttcaa aaaccacttt tttaataatt tgtgtgctta aatggtaagg aatattccca 5940 acaattttat acctctgttt gttagggaat tgaaactgta gaatatcttg gtgaattaaa 6000 gtgacacgag tattcagttt taatttttct gacgataagt tgaatagatg actgtctaat 6060 tcaatagacg ttacctgttt acttatttta gccagtttcg tcgttaaatg ccctttacct 6120 gttccaattt cgtaaacggt atcggtttct tttaaattca attgttttat tatttggttg 6180 agtacttttt cactcgttaa aaagttttga gaatatttta tatttttgtt catgtaatct 6240 ctcctgaagt gattacatct ataaacaaat acagaagtta aacgatttgt ttgtaatttt 6300 agttatctgt ttaaaaagtc ataagattag tcactggtag gaattaatct aacgtattta 6360 tttatctgcg taatcactgt ttttagtctg tttcaaaaca gtagatgttt tatctacatt 6420 acgcatttgg aataccaaca tgacgaatcc ctccttctta attacaaatt tttagcatct 6480 aatttaactt caattcctat tatacaaaat tttaagataa tgcactatca acacactctt 6540 aagtttgctt ctaagtctta tttccataac ttcttttacg tttccgccat tctttgctgt 6600 ttcgattttt atgatatggt gcaagtcagc acgaacacga accgtcttat ctcccattat 6660 atcttttttt gcactgattg gtgtatcatt tcgtttttct ttttgtgcgc ctaaatttcc 6720 cacaatcact cacttctttc tatttcttct tattcttatt ttatcatcaa caatcacaaa 6780 tcacttgtga tttgtgataa gtgatttgtg ataagtgatt tgtgattaat atataaaagc 6840 cctctttaaa gggcttttat gtttattttg agaaagatat aaaatcaata tatccctttt 6900 ccccaatttt tacaacggca ttgtagggct ttttctcttt gtttttgatt tccttttacc 6960 agggtttctt ttccctccag taattctttt acattcgttt tggttagttt tttctttcta 7020 aaatgttcag ctaaagtaaa cttacattca ggataatttg aacaaccata aaacgatttt 7080 tttaatacaa tattgttgcc acacttagga cattttccta caatactttt ttctgcttct 7140 ttttctttct gttcctggta atcagaaaaa tttagttttt ctatatcgtt aggtacagct 7200 tccagtaaat gaacaatgaa ttttttgata ttcgtaataa agttctcttg attgccttct 7260 cttttaccga ttttttttaa atacgtttcc catttagccg tcatttcagc actcgttaaa 7320 aggtgctgac tttcaactgc ctggcacaat aattttcctt tttcagttac aacagcctgc 7380 aggtcgacga taaccacata acagtcataa aaccactcct ttttaacaaa ctttatcaca 7440 agaaatattt tggcattcta cgactataac ttaaatttat attttttact ttataatata 7500 taattgatta tagaataatg ttgctcatat cgtttgccaa catctagtac tcaaattaca 7560 ctatgttaca cttggtaata ttaaccgaac ttcccctgtc caaattagat aagaggtaat 7620 aataaatgga aaataatttt atagtaaatg aaaatgagaa gcgtgtatta aaacaaattt 7680 tcaataacag caatatttca cgaacacaaa tatcgaagaa tttagaactt aataaagcta 7740 ctatttctaa cattctgaac aacttaaaac acaagagttt agttaatgaa gtaggagaag 7800 gtaatagtac taaaagtggt ggacgaaagc ctattttact cgaaattaac caaaaatatg 7860 gctactatat ttctatggat ttaacatatg attccgttga attaatgtac aactactttg 7920 atgctactat attaaagcaa gattcctacg aattaaatga taaaaatgta agcagtatat 7980 tacaaatttt aaaatctaat ataaacgtct cagaaaaata tgatacgtta tatgggttac 8040 ttggtatatc tatatccata cacggtatcg ttgacgatga gcaaaacata atcaatcttc 8100 cttttcataa aaatgagaaa cgcacattta ccgatgaatt aaagtcattc acaaatgttc 8160 ctgtcgttat agaaaatgaa gcaaatttat cagcgctata tgaaaaaagt ttatatatta 8220 attcaaacat aaataatttg attactttaa gtattcacaa gggtataggc gctggcatca 8280 taataaataa aaaactttat cgtggctcaa atggagaggc tggagagata ggtaagacat 8340 tggttttgga atctataaat aacaatgaca acaaatatta taaaatcgaa gatatatgct 8400 cccaagacgc tttaatacag aaaataaata ataggttggg cgtcacattg acgtttacag 8460 aactaatcca atattacaac gaaggaaatt caattgttgc tcatgaaatt aaacaattta 8520 ttaataaaat gacagttctg attcataatt tgaatacaca atttaaccca gacgctattt 8580 atattaactg tcctttaatt aatgaattac caaatatttt aaatgaaatt aaagagcaat 8640 tctcctgttt ttctcaaggc agtccaattc aattacattt aactactaat gtaaaacaag 8700 ctactttatt gggtggcact ttagcaataa tgcaaaaaac attaaatata aataacattc 8760 aaatgaatat taaataatta cagcagtctg agttataaaa tagatatctc ggaccgaaaa 8820 atgacagttt attcttgaca gggagagata ggtttgatat aatataatag ttgtctcgag 8880 agttagtttg tttattaaat taaccaactg gacgaattcc cggggatccg tcgacctgca 8940 gccaagcttg ggcttttcag cctgatacag attaaatcag aacgcagaag cggtctgata 9000 aaacagaatt tgcctggcgg cagtagcgcg gtggtcccac ctgaccccat gccgaactca 9060 gaagtgaaac gccgtagcgc cgatggtagt gtggggtctc cccatgcgag agtagggaac 9120 tgccaggcat caaataaaac gaaaggctca gtcgaaagac tgggcctttc gttttatctg 9180 ttgtttgtcg gtgaacgctc tcctgagtag gacaaatccg ccgggagcgg atttgaacgt 9240 tgcgaagcaa cggcccggag ggtggcgggc aggacgcccg ccataaactg ccaggcatca 9300 aattaagcag aaggccatcc tgacggatgg cctttttgcg tttctacaaa ctctttttgt 9360 ttatttttct aaatacattc aaatatgtat ccgctgatcc ccatcctatc gatcccc 9417 49 9484 DNA Artificial Sequence Vector P1P2/EF14 49 acccgatgcc cttgagagcc ttcaacccag tcagctcctt ccggtgggcg cggggcatga 60 ctatcgtcgc cgcacttatg actgtcttct ttatcatgca actcgtagga caggtgccgg 120 cagcgctctg ggtcattttc ggcgaggacc gctttcgctg gagcgcgacg atgatcggcc 180 tgtcgcttgc ggtattcgga atcttgcacg ccctcgctca agccttcgtc actggtcccg 240 ccaccaaacg tttcggcgag aagcaggcca ttatcgccgg catggcggcc gacgcgctgg 300 gctacgtctt gctggcgttc gcgacgcgag gctggatggc cttccccatt atgattcttc 360 tcgcttccgg cggcatcggg atgcccgcgt tgcaggccat gctgtccagg caggtagatg 420 acgaccatca gggacagctt caaggatcgc tcgcggctct taccagccta acttcgatca 480 ctggaccgct gatcgtcacg gcgatttatg ccgcctcggc gagcacatgg aacgggttgg 540 catggattgt aggcgccgcc ctataccttg tctgcctccc cgcgttgcgt cgcggtgcat 600 ggagccgggc cacctcgacc tgaatggaag ccggcggcac ctcgctaacg gattcaccac 660 tccaagaatt ggagccaatc aattcttgcg gagaactgtg aatgcgcaaa ccaacccttg 720 gcagaacata tccatcgcgt ccgccatctc cagcagccgc acgcggcgca tctcgggcag 780 cgttgggtcc tggccacggg tgcgcatgat cgtgctcctg tcgttgagga cccggctagg 840 ctggcggggt tgccttactg gttagcagaa tgaatcaccg atacgcgagc gaacgtgaag 900 cgactgctgc tgcaaaacgt ctgcgacctg agcaacaaca tgaatggtct tcggtttccg 960 tgtttcgtaa agtctggaaa cgcggaagtc ccctacgtgc tgctgaagtt gcccgcaaca 1020 gagagtggaa ccaaccggtg ataccacgat actatgactg agagtcaacg ccatgagcgg 1080 cctcatttct tattctgagt tacaacagtc cgcaccgctg tccggtagct ccttccggtg 1140 ggcgcggggc atgactatcg tcgccgcact tatgactgtc ttctttatca tgcaactcgt 1200 aggacaggtg ccggcagcgc ccaacagtcc cccggccacg gggcctgcca ccatacccac 1260 gccgaaacaa gcgccctgca ccattatgtt ccggatctgc atcgcaggat gctgctggct 1320 accctgtgga acacctacat ctgtattaac gaagcgctaa ccgtttttat caggctctgg 1380 gaggcagaat aaatgatcat atcgtcaatt attacctcca cggggagagc ctgagcaaac 1440 tggcctcagg catttgagaa gcacacggtc acactgcttc cggtagtcaa taaaccggta 1500 aaccagcaat agacataagc ggctatttaa cgaccctgcc ctgaaccgac gaccgggtcg 1560 aatttgcttt cgaatttctg ccattcatcc gcttattatc acttattcag gcgtagcacc 1620 aggcgtttaa gggcaccaat aactgcctta aaaaaattac gccccgccct gccactcatc 1680 gcagtactgt tgtaattcat taagcattct gccgacatgg aagccatcac agacggcatg 1740 atgaacctga atcgccagcg gcatcagcac cttgtcgcct tgcgtataat atttgcccat 1800 ggtgaaaacg ggggcgaaga agttgtccat attggccacg tttaaatcaa aactggtgaa 1860 actcacccag ggattggctg agacgaaaaa catattctca ataaaccctt tagggaaata 1920 ggccaggttt tcaccgtaac acgccacatc ttgcgaatat atgtgtagaa actgccggaa 1980 atcgtcgtgg tattcactcc agagcgatga aaacgtttca gtttgctcat ggaaaacggt 2040 gtaacaaggg tgaacactat cccatatcac cagctcaccg tctttcattg ccatacgaat 2100 tcagaggttt gatgactttg acctttggtt ttaaattttt gaaaaaaata aaaaaaggcg 2160 aagcctatta tatatttatc atatatattt taatcttttg ttcttttgcg tgaaaaaaaa 2220 ggcagtgttt tcgctagtta tagaaattaa acagtcacaa aaatcgatgt atacagtcac 2280 aaaaatcgat ttttgtgact ctatgcatcg atgtatacga gtcacaaaaa tcgatgtaca 2340 cagtacgact tttgtatttg tgtactgtat atagtataat aaaagcatag agaaaactca 2400 ctatgaaatg actttctcta tgctactact aaaacacgca aaggagcgta tttatactat 2460 gattatacca gaaaaacaaa acaaacaaaa gcaggtgctg accttgaatg aattagaaaa 2520 acgcaaagta gtggagcata acgctttaat tcaaagtgtg gctaaaatgc aaaaaacagc 2580 tttaaaaatg tttgagttag ccgtgtcttg tattgatacc gaagaacccc caaaaaataa 2640 tactgtttat cttttaaaat ctgaactttt taaatttttt gaagtatcgt ctagcagtaa 2700 gcatagtcaa ttcaaagaag cagttaatta tatgcaaaag caagcctttt tcaatattaa 2760 agctgataag aaactaggta ttgagtatga aagcattgtg cctattcctt atgtaaaatg 2820 gaacgattat aacgatgaag taacaatacg ttttgaccaa gccattatgc cctatctgat 2880 tgatttaaag gcagaattta ctcaatataa aatttccgaa ctacaaaaac tcaatagcaa 2940 gtattcgatt attttgtatc gttggttatc catgaattat aaccaatacg aacattacag 3000 cgtgaaaggg ggacggagag ttgaacaagt ggaatcctac cgcaacccct caataaaagt 3060 gaaagaattg cgagaaataa ctgatacaat aaatgaacat caacattttc cccattttga 3120 aactagagta ttaaaaaaag caattgaaga aatcaacgct cacacctctt ttaatgtaac 3180 ctatgagaaa aagaaagcag ggcggagcat tgactctatt gtctttcata ttgagaagaa 3240 acgcatggca gacgataaca gttacaagtt gggagataaa gactatcaag acgataaaaa 3300 gcaaaaatca agaaatgaag ctgacttact aaaacaggca atggaaagta aatatacaag 3360 acttttatct gaaaatttct taattggcat gaatgatatt atggatacag ccacaatggt 3420 aggcttacag aagaatgttt atccgcttta tgacgaactg aaagaattaa gagggctaaa 3480 tggtgtcaaa gaccacttgt cttatgttgc cagcaaaaaa gaagcctatt ctaaacgtaa 3540 tgtagcgaaa tatcttaaaa aagccattga acaatatctg ccaacggtta aaaggcagga 3600 cttaaaccat gagtgaagac ttaaaaacga taaaagagtt ggcggacgag ctaggactag 3660 ctaaacagac cgttcaatac catttaaagt ttttgccgac cgaaaatcga caaaaaaaca 3720 atagtggttc gatcgtccta aaccttgaag aacaaaagtt tatcaaaaaa agactaacaa 3780 aaataaaaaa agttcaaccg accgaaaatc gacaagaaaa aatgaagaat taagcgttaa 3840 tagtcttttg ttaggaattc cggatgagca ttcatcaggc gggcaagaat gtgaataaag 3900 gccggataaa acttgtgctt atttttcttt acggtcttta aaaaggccgt aatatccagc 3960 tgaacggtct ggttataggt acattgagca actgactgaa atgcctcaaa atgttcttta 4020 cgatgccatt gggatatatc aacggtggta tatccagtga tttttttctc cattttagct 4080 tccttagctc ctgaaaatct cgataactca aaaaatacgc ccggtagtga tcttatttca 4140 ttatggtgaa agttggaacc tcttacgtgc cgatcaacgt ctcattttcg ccaaaagttg 4200 gcccagggct tcccggtatc aacagggaca ccaggattta tttattctgc gaagtgatct 4260 tccgtcacag gtatttattc ggcgcaaagt gcgtcgggtg atgctgccaa cttactgatt 4320 tagtgtatga tggtgttttt gaggtgctcc agtggcttct gtttctatca gctgtccctc 4380 ctgttcagct actgacgggg tggtgcgtaa cggcaaaagc accgccggac atcagcgcta 4440 gcggagtgta tactggctta ctatgttggc actgatgagg gtgtcagtga agtgcttcat 4500 gtggcaggag aaaaaaggct gcaccggtgc gtcagcagaa tatgtgatac aggatatatt 4560 ccgcttcctc gctcactgac tcgctacgct cggtcgttcg actgcggcga gcggaaatgg 4620 cttacgaacg gggcggagat ttcctggaag atgccaggaa gatacttaac agggaagtga 4680 gagggccgcg gcaaagccgt ttttccatag gctccgcccc cctgacaagc atcacgaaat 4740 ctgacgctca aatcagtggt ggcgaaaccc gacaggacta taaagatacc aggcgtttcc 4800 ccctggcggc tccctcgtgc gctctcctgt tcctgccttt cggtttaccg gtgtcattcc 4860 gctgttatgg ccgcgtttgt ctcattccac gcctgacact cagttccggg taggcagttc 4920 gctccaagct ggactgtatg cacgaacccc ccgttcagtc cgaccgctgc gccttatccg 4980 gtaactatcg tcttgagtcc aacccggaaa gacatgcaaa agcaccactg gcagcagcca 5040 ctggtaattg atttagagga gttagtcttg aagtcatgcg ccggttaagg ctaaactgaa 5100 aggacaagtt ttggtgactg cgctcctcca agccagttac ctcggttcaa agagttggta 5160 gctcagagaa ccttcgaaaa accgccctgc aaggcggttt tttcgttttc agagcaagag 5220 attacgcgca gaccaaaacg atctcaagaa gatcatctta ttaatcagat aaaatatttc 5280 tagatttcag tgcaatttat ctcttcaaat gtagcacctg aagtcagccc catacgatat 5340 aagttgtaat tctcatgttt gacagcttat catcgataca aattccccgt aggcgctagg 5400 gacctcttta gcttcttgga agctgtcagt agtatatcta ataatttatc tccattccct 5460 ttagtaacgt gtaactttcc aaatttaaaa aagcgactca tagaattatt tcctcccgtt 5520 aaataataga taactattaa aaatagacaa tacttgctca taagtaatgg tacttaaatt 5580 gtttactttg gcgtgtttca ttgcttgatg aaactgattt ttagtaaaca gttgacgata 5640 ttctcgattg acccattttg aaacaaagta cgtatatagc ttccaatatt tatctggaac 5700 atctgtggta tggcgggtaa gttttattaa gacactgttt acttttggtt taggatgaaa 5760 gcattccgct ggcagcttaa gcaattgctg aatcgagact tgagtgtgca agagcaaccc 5820 tagtgttcgg tgaatatcca aggtacgctt gtagaatcct tcttcaacaa tcagatagat 5880 gtcagacgca cggctttcaa aaaccacttt tttaataatt tgtgtgctta aatggtaagg 5940 aatattccca acaattttat acctctgttt gttagggaat tgaaactgta gaatatcttg 6000 gtgaattaaa gtgacacgag tattcagttt taatttttct gacgataagt tgaatagatg 6060 actgtctaat tcaatagacg ttacctgttt acttatttta gccagtttcg tcgttaaatg 6120 ccctttacct gttccaattt cgtaaacggt atcggtttct tttaaattca attgttttat 6180 tatttggttg agtacttttt cactcgttaa aaagttttga gaatatttta tatttttgtt 6240 catgtaatct ctcctgaagt gattacatct ataaacaaat acagaagtta aacgatttgt 6300 ttgtaatttt agttatctgt ttaaaaagtc ataagattag tcactggtag gaattaatct 6360 aacgtattta tttatctgcg taatcactgt ttttagtctg tttcaaaaca gtagatgttt 6420 tatctacatt acgcatttgg aataccaaca tgacgaatcc ctccttctta attacaaatt 6480 tttagcatct aatttaactt caattcctat tatacaaaat tttaagataa tgcactatca 6540 acacactctt aagtttgctt ctaagtctta tttccataac ttcttttacg tttccgccat 6600 tctttgctgt ttcgattttt atgatatggt gcaagtcagc acgaacacga accgtcttat 6660 ctcccattat atcttttttt gcactgattg gtgtatcatt tcgtttttct ttttgtgcgc 6720 ctaaatttcc cacaatcact cacttctttc tatttcttct tattcttatt ttatcatcaa 6780 caatcacaaa tcacttgtga tttgtgataa gtgatttgtg ataagtgatt tgtgattaat 6840 atataaaagc cctctttaaa gggcttttat gtttattttg agaaagatat aaaatcaata 6900 tatccctttt ccccaatttt tacaacggca ttgtagggct ttttctcttt gtttttgatt 6960 tccttttacc agggtttctt ttccctccag taattctttt acattcgttt tggttagttt 7020 tttctttcta aaatgttcag ctaaagtaaa cttacattca ggataatttg aacaaccata 7080 aaacgatttt tttaatacaa tattgttgcc acacttagga cattttccta caatactttt 7140 ttctgcttct ttttctttct gttcctggta atcagaaaaa tttagttttt ctatatcgtt 7200 aggtacagct tccagtaaat gaacaatgaa ttttttgata ttcgtaataa agttctcttg 7260 attgccttct cttttaccga ttttttttaa atacgtttcc catttagccg tcatttcagc 7320 actcgttaaa aggtgctgac tttcaactgc ctggcacaat aattttcctt tttcagttac 7380 aacagcctgc aggtcgacga taaccacata acagtcataa aaccactcct ttttaacaaa 7440 ctttatcaca agaaatattt tggcattcta cgactataac ttaaatttat attttttact 7500 ttataatata taattgatta tagaataatg ttgctcatat cgtttgccaa catctagtac 7560 tcaaattaca ctatgttaca cttggtaata ttaaccgaac ttcccctgtc caaattagat 7620 aagaggtaat aataaatgga aaataatttt atagtaaatg aaaatgagaa gcgtgtatta 7680 aaacaaattt tcaataacag caatatttca cgaacacaaa tatcgaagaa tttagaactt 7740 aataaagcta ctatttctaa cattctgaac aacttaaaac acaagagttt agttaatgaa 7800 gtaggagaag gtaatagtac taaaagtggt ggacgaaagc ctattttact cgaaattaac 7860 caaaaatatg gctactatat ttctatggat ttaacatatg attccgttga attaatgtac 7920 aactactttg atgctactat attaaagcaa gattcctacg aattaaatga taaaaatgta 7980 agcagtatat tacaaatttt aaaatctaat ataaacgtct cagaaaaata tgatacgtta 8040 tatgggttac ttggtatatc tatatccata cacggtatcg ttgacgatga gcaaaacata 8100 atcaatcttc cttttcataa aaatgagaaa cgcacattta ccgatgaatt aaagtcattc 8160 acaaatgttc ctgtcgttat agaaaatgaa gcaaatttat cagcgctata tgaaaaaagt 8220 ttatatatta attcaaacat aaataatttg attactttaa gtattcacaa gggtataggc 8280 gctggcatca taataaataa aaaactttat cgtggctcaa atggagaggc tggagagata 8340 ggtaagacat tggttttgga atctataaat aacaatgaca acaaatatta taaaatcgaa 8400 gatatatgct cccaagacgc tttaatacag aaaataaata ataggttggg cgtcacattg 8460 acgtttacag aactaatcca atattacaac gaaggaaatt caattgttgc tcatgaaatt 8520 aaacaattta ttaataaaat gacagttctg attcataatt tgaatacaca atttaaccca 8580 gacgctattt atattaactg tcctttaatt aatgaattac caaatatttt aaatgaaatt 8640 aaagagcaat tctcctgttt ttctcaaggc agtccaattc aattacattt aactactaat 8700 gtaaaacaag ctactttatt gggtggcact ttagcaataa tgcaaaaaac attaaatata 8760 aataacattc aaatgaatat taaataatta cagcagtctg agttataaaa tagatatctc 8820 ggaccggtct agttattctt gacaaaaaat aatattttga tataattaaa tagttgtcgt 8880 ttgataacac agtttattct tgacaaaaaa atataaaagt gtataataga aaagtactgt 8940 tctcgagagt tagtttgttt attaaattaa ccaactggac gaattcccgg ggatccgtcg 9000 acctgcagcc aagcttgggc ttttcagcct gatacagatt aaatcagaac gcagaagcgg 9060 tctgataaaa cagaatttgc ctggcggcag tagcgcggtg gtcccacctg accccatgcc 9120 gaactcagaa gtgaaacgcc gtagcgccga tggtagtgtg gggtctcccc atgcgagagt 9180 agggaactgc caggcatcaa ataaaacgaa aggctcagtc gaaagactgg gcctttcgtt 9240 ttatctgttg tttgtcggtg aacgctctcc tgagtaggac aaatccgccg ggagcggatt 9300 tgaacgttgc gaagcaacgg cccggagggt ggcgggcagg acgcccgcca taaactgcca 9360 ggcatcaaat taagcagaag gccatcctga cggatggcct ttttgcgttt ctacaaactc 9420 tttttgttta tttttctaaa tacattcaaa tatgtatccg ctgatcccca tcctatcgat 9480 cccc 9484 50 9566 DNA Artificial Sequence Vector pEPEF18 50 acccagtcag ctccttccgg tgggcgcggg gcatgactat cgtcgccgca cttatgactg 60 tcttctttat catgcaactc gtaggacagg tgccggcagc gctctgggtc attttcggcg 120 aggaccgctt tcgctggagc gcgacgatga tcggcctgtc gcttgcggta ttcggaatct 180 tgcacgccct cgctcaagcc ttcgtcactg gtcccgccac caaacgtttc ggcgagaagc 240 aggccattat cgccggcatg gcggccgacg cgctgggcta cgtcttgctg gcgttcgcga 300 cgcgaggctg gatggccttc cccattatga ttcttctcgc ttccggcggc atcgggatgc 360 ccgcgttgca ggccatgctg tccaggcagg tagatgacga ccatcaggga cagcttcaag 420 gatcgctcgc ggctcttacc agcctaactt cgatcactgg accgctgatc gtcacggcga 480 tttatgccgc ctcggcgagc acatggaacg ggttggcatg gattgtaggc gccgccctat 540 accttgtctg cctccccgcg ttgcgtcgcg gtgcatggag ccgggccacc tcgacctgaa 600 tggaagccgg cggcacctcg ctaacggatt caccactcca agaattggag ccaatcaatt 660 cttgcggaga actgtgaatg cgcaaaccaa cccttggcag aacatatcca tcgcgtccgc 720 catctccagc agccgcacgc ggcgcatctc gggcagcgtt gggtcctggc cacgggtgcg 780 catgatcgtg ctcctgtcgt tgaggacccg gctaggctgg cggggttgcc ttactggtta 840 gcagaatgaa tcaccgatac gcgagcgaac gtgaagcgac tgctgctgca aaacgtctgc 900 gacctgagca acaacatgaa tggtcttcgg tttccgtgtt tcgtaaagtc tggaaacgcg 960 gaagtcccct acgtgctgct gaagttgccc gcaacagaga gtggaaccaa ccggtgatac 1020 cacgatacta tgactgagag tcaacgccat gagcggcctc atttcttatt ctgagttaca 1080 acagtccgca ccgctgtccg gtagctcctt ccggtgggcg cggggcatga ctatcgtcgc 1140 cgcacttatg actgtcttct ttatcatgca actcgtagga caggtgccgg cagcgcccaa 1200 cagtcccccg gccacggggc ctgccaccat acccacgccg aaacaagcgc cctgcaccat 1260 tatgttccgg atctgcatcg caggatgctg ctggctaccc tgtggaacac ctacatctgt 1320 attaacgaag cgctaaccgt ttttatcagg ctctgggagg cagaataaat gatcatatcg 1380 tcaattatta cctccacggg gagagcctga gcaaactggc ctcaggcatt tgagaagcac 1440 acggtcacac tgcttccggt agtcaataaa ccggtaaacc agcaatagac ataagcggct 1500 atttaacgac cctgccctga accgacgacc gggtcgaatt tgctttcgaa tttctgccat 1560 tcatccgctt attatcactt attcaggcgt agcaccaggc gtttaagggc accaataact 1620 gccttaaaaa aattacgccc cgccctgcca ctcatcgcag tactgttgta attcattaag 1680 cattctgccg acatggaagc catcacagac ggcatgatga acctgaatcg ccagcggcat 1740 cagcaccttg tcgccttgcg tataatattt gcccatggtg aaaacggggg cgaagaagtt 1800 gtccatattg gccacgttta aatcaaaact ggtgaaactc acccagggat tggctgagac 1860 gaaaaacata ttctcaataa accctttagg gaaataggcc aggttttcac cgtaacacgc 1920 cacatcttgc gaatatatgt gtagaaactg ccggaaatcg tcgtggtatt cactccagag 1980 cgatgaaaac gtttcagttt gctcatggaa aacggtgtaa caagggtgaa cactatccca 2040 tatcaccagc tcaccgtctt tcattgccat acgagaggtt tgatgacttt gacctttggt 2100 tttaaatttt tgaaaaaaat aaaaaaaggc gaagcctatt atatatttat catatatatt 2160 ttaatctttt gttcttttgc gtgaaaaaaa aggcagtgtt ttcgctagtt atagaaatta 2220 aacagtcaca aaaatcgatg tatacagtca caaaaatcga tttttgtgac tctatgcatc 2280 gatgtatacg agtcacaaaa atcgatgtac acagtacgac ttttgtattt gtgtactgta 2340 tatagtataa taaaagcata gagaaaactc actatgaaat gactttctct atgctactac 2400 taaaacacgc aaaggagcgt atttatacta tgattatacc agaaaaacaa aacaaacaaa 2460 agcaggtgct gaccttgaat gaattagaaa aacgcaaagt agtggagcat aacgctttaa 2520 ttcaaagtgt ggctaaaatg caaaaaacag ctttaaaaat gtttgagtta gccgtgtctt 2580 gtattgatac cgaagaaccc ccaaaaaata atactgttta tcttttaaaa tctgaacttt 2640 ttaaattttt tgaagtatcg tctagcagta agcatagtca attcaaagaa gcagttaatt 2700 atatgcaaaa gcaagccttt ttcaatatta aagctgataa gaaactaggt attgagtatg 2760 aaagcattgt gcctattcct tatgtaaaat ggaacgatta taacgatgaa gtaacaatac 2820 gttttgacca agccattatg ccctatctga ttgatttaaa ggcagaattt actcaatata 2880 aaatttccga actacaaaaa ctcaatagca agtattcgat tattttgtat cgttggttat 2940 ccatgaatta taaccaatac gaacattaca gcgtgaaagg gggacggaga gttgaacaag 3000 tggaatccta ccgcaacccc tcaataaaag tgaaagaatt gcgagaaata actgatacaa 3060 taaatgaaca tcaacatttt ccccattttg aaactagagt attaaaaaaa gcaattgaag 3120 aaatcaacgc tcacacctct tttaatgtaa cctatgagaa aaagaaagca gggcggagca 3180 ttgactctat tgtctttcat attgagaaga aacgcatggc agacgataac agttacaagt 3240 tgggagataa agactatcaa gacgataaaa agcaaaaatc aagaaatgaa gctgacttac 3300 taaaacaggc aatggaaagt aaatatacaa gacttttatc tgaaaatttc ttaattggca 3360 tgaatgatat tatggataca gccacaatgg taggcttaca gaagaatgtt tatccgcttt 3420 atgacgaact gaaagaatta agagggctaa atggtgtcaa agaccacttg tcttatgttg 3480 ccagcaaaaa agaagcctat tctaaacgta atgtagcgaa atatcttaaa aaagccattg 3540 aacaatatct gccaacggtt aaaaggcagg acttaaacca tgagtgaaga cttaaaaacg 3600 ataaaagagt tggcggacga gctaggacta gctaaacaga ccgttcaata ccatttaaag 3660 tttttgccga ccgaaaatcg acaaaaaaac aatagtggtt cgatcgtcct aaaccttgaa 3720 gaacaaaagt ttatcaaaaa aagactaaca aaaataaaaa aagttcaacc gaccgaaaat 3780 cgacaagaaa aaatgaagaa ttaagcgtta atagtctttt gttaggcgga tgagcattca 3840 tcaggcgggc aagaatgtga ataaaggccg gataaaactt gtgcttattt ttctttacgg 3900 tctttaaaaa ggccgtaata tccagctgaa cggtctggtt ataggtacat tgagcaactg 3960 actgaaatgc ctcaaaatgt tctttacgat gccattggga tatatcaacg gtggtatatc 4020 cagtgatttt tttctccatt ttagcttcct tagctcctga aaatctcgat aactcaaaaa 4080 atacgcccgg tagtgatctt atttcattat ggtgaaagtt ggaacctctt acgtgccgat 4140 caacgtctca ttttcgccaa aagttggccc agggcttccc ggtatcaaca gggacaccag 4200 gatttattta ttctgcgaag tgatcttccg tcacaggtat ttattcggcg caaagtgcgt 4260 cgggtgatgc tgccaactta ctgatttagt gtatgatggt gtttttgagg tgctccagtg 4320 gcttctgttt ctatcagctg tccctcctgt tcagctactg acggggtggt gcgtaacggc 4380 aaaagcaccg ccggacatca gcgctagcgg agtgtatact ggcttactat gttggcactg 4440 atgagggtgt cagtgaagtg cttcatgtgg caggagaaaa aaggctgcac cggtgcgtca 4500 gcagaatatg tgatacagga tatattccgc ttcctcgctc actgactcgc tacgctcggt 4560 cgttcgactg cggcgagcgg aaatggctta cgaacggggc ggagatttcc tggaagatgc 4620 caggaagata cttaacaggg aagtgagagg gccgcggcaa agccgttttt ccataggctc 4680 cgcccccctg acaagcatca cgaaatctga cgctcaaatc agtggtggcg aaacccgaca 4740 ggactataaa gataccaggc gtttccccct ggcggctccc tcgtgcgctc tcctgttcct 4800 gcctttcggt ttaccggtgt cattccgctg ttatggccgc gtttgtctca ttccacgcct 4860 gacactcagt tccgggtagg cagttcgctc caagctggac tgtatgcacg aaccccccgt 4920 tcagtccgac cgctgcgcct tatccggtaa ctatcgtctt gagtccaacc cggaaagaca 4980 tgcaaaagca ccactggcag cagccactgg taattgattt agaggagtta gtcttgaagt 5040 catgcgccgg ttaaggctaa actgaaagga caagttttgg tgactgcgct cctccaagcc 5100 agttacctcg gttcaaagag ttggtagctc agagaacctt cgaaaaaccg ccctgcaagg 5160 cggttttttc gttttcagag caagagatta cgcgcagacc aaaacgatct caagaagatc 5220 atcttattaa tcagataaaa tatttctaga tttcagtgca atttatctct tcaaatgtag 5280 cacctgaagt cagccccata cgatataagt tgtaattctc atgtttgaca gcttatcatc 5340 gatacaaatt ccccgtaggc gctagggacc tctttagctt cttggaagct gtcagtagta 5400 tatctaataa tttatctcca ttccctttag taacgtgtaa ctttccaaat ttaaaaaagc 5460 gactcataga attatttcct cccgttaaat aatagataac tattaaaaat agacaatact 5520 tgctcataag taatggtact taaattgttt actttggcgt gtttcattgc ttgatgaaac 5580 tgatttttag taaacagttg acgatattct cgattgaccc attttgaaac aaagtacgta 5640 tatagcttcc aatatttatc tggaacatct gtggtatggc gggtaagttt tattaagaca 5700 ctgtttactt ttggtttagg atgaaagcat tccgctggca gcttaagcaa ttgctgaatc 5760 gagacttgag tgtgcaagag caaccctagt gttcggtgaa tatccaaggt acgcttgtag 5820 aatccttctt caacaatcag atagatgtca gacgcacggc tttcaaaaac cactttttta 5880 ataatttgtg tgcttaaatg gtaaggaata ttcccaacaa ttttatacct ctgtttgtta 5940 gggaattgaa actgtagaat atcttggtga attaaagtga cacgagtatt cagttttaat 6000 ttttctgacg ataagttgaa tagatgactg tctaattcaa tagacgttac ctgtttactt 6060 attttagcca gtttcgtcgt taaatgccct ttacctgttc caatttcgta aacggtatcg 6120 gtttctttta aattcaattg ttttattatt tggttgagta ctttttcact cgttaaaaag 6180 ttttgagaat attttatatt tttgttcatg taatctctcc tgaagtgatt acatctataa 6240 acaaatacag aagttaaacg atttgtttgt aattttagtt atctgtttaa aaagtcataa 6300 gattagtcac tggtaggaat taatctaacg tatttattta tctgcgtaat cactgttttt 6360 agtctgtttc aaaacagtag atgttttatc tacattacgc atttggaata ccaacatgac 6420 gaatccctcc ttcttaatta caaattttta gcatctaatt taacttcaat tcctattata 6480 caaaatttta agataatgca ctatcaacac actcttaagt ttgcttctaa gtcttatttc 6540 cataacttct tttacgtttc cgccattctt tgctgtttcg atttttatga tatggtgcaa 6600 gtcagcacga acacgaaccg tcttatctcc cattatatct ttttttgcac tgattggtgt 6660 atcatttcgt ttttcttttt gtgcgcctaa atttcccaca atcactcact tctttctatt 6720 tcttcttatt cttattttat catcaacaat cacaaatcac ttgtgatttg tgataagtga 6780 tttgtgataa gtgatttgtg attaatatat aaaagccctc tttaaagggc ttttatgttt 6840 attttgagaa agatataaaa tcaatatatc ccttttcccc aatttttaca acggcattgt 6900 agggcttttt ctctttgttt ttgatttcct tttaccaggg tttcttttcc ctccagtaat 6960 tcttttacat tcgttttggt tagttttttc tttctaaaat gttcagctaa agtaaactta 7020 cattcaggat aatttgaaca accataaaac gattttttta atacaatatt gttgccacac 7080 ttaggacatt ttcctacaat acttttttct gcttcttttt ctttctgttc ctggtaatca 7140 gaaaaattta gtttttctat atcgttaggt acagcttcca gtaaatgaac aatgaatttt 7200 ttgatattcg taataaagtt ctcttgattg ccttctcttt taccgatttt ttttaaatac 7260 gtttcccatt tagccgtcat ttcagcactc gttaaaaggt gctgactttc aactgcctgg 7320 cacaataatt ttcctttttc agttacaaca gcctgcaggt cgacgataac cacataacag 7380 tcataaaacc actccttttt aacaaacttt atcacaagaa atattttggc attctacgac 7440 tataacttaa atttatattt tttactttat aatatataat tgattataga ataatgttgc 7500 tcatatcgtt tgccaacatc tagtactcaa attacactat gttacacttg gtaatattaa 7560 ccgaacttcc cctgtccaaa ttagataaga ggtaataata aatggaaaat aattttatag 7620 taaatgaaaa tgagaagcgt gtattaaaac aaattttcaa taacagcaat atttcacgaa 7680 cacaaatatc gaagaattta gaacttaata aagctactat ttctaacatt ctgaacaact 7740 taaaacacaa gagtttagtt aatgaagtag gagaaggtaa tagtactaaa agtggtggac 7800 gaaagcctat tttactcgaa attaaccaaa aatatggcta ctatatttct atggatttaa 7860 catatgattc cgttgaatta atgtacaact actttgatgc tactatatta aagcaagatt 7920 cctacgaatt aaatgataaa aatgtaagca gtatattaca aattttaaaa tctaatataa 7980 acgtctcaga aaaatatgat acgttatatg ggttacttgg tatatctata tccatacacg 8040 gtatcgttga cgatgagcaa aacataatca atcttccttt tcataaaaat gagaaacgca 8100 catttaccga tgaattaaag tcattcacaa atgttcctgt cgttatagaa aatgaagcaa 8160 atttatcagc gctatatgaa aaaagtttat atattaattc aaacataaat aatttgatta 8220 ctttaagtat tcacaagggt ataggcgctg gcatcataat aaataaaaaa ctttatcgtg 8280 gctcaaatgg agaggctgga gagataggta agacattggt tttggaatct ataaataaca 8340 atgacaacaa atattataaa atcgaagata tatgctccca agacgcttta atacagaaaa 8400 taaataatag gttgggcgtc acattgacgt ttacagaact aatccaatat tacaacgaag 8460 gaaattcaat tgttgctcat gaaattaaac aatttattaa taaaatgaca gttctgattc 8520 ataatttgaa tacacaattt aacccagacg ctatttatat taactgtcct ttaattaatg 8580 aattaccaaa tattttaaat gaaattaaag agcaattctc ctgtttttct caaggcagtc 8640 caattcaatt acatttaact actaatgtaa aacaagctac tttattgggt ggcactttag 8700 caataatgca aaaaacatta aatataaata acattcaaat gaatattaaa taattacagc 8760 agtctgagtt ataaaataga tatctcggac cgaaacagcc gaataaggct gtttttcttt 8820 tttttatgtt ttagaataag tggtctagtt tattcttgac aaaaaataat attttgatat 8880 aattaaatag ttgtcgtttg agacgactga ctttcttatt attcatctaa aatattattt 8940 tgaaaagata acacagttat tcttgacaaa aaaatataaa agtgtataat agaaaagtac 9000 tcgagagtta gtttgtttat taaattaacc aactggacga attcccgggg atccgtcgac 9060 ctgcagccaa gcttgggctt ttcagcctga tacagattaa atcagaacgc agaagcggtc 9120 tgataaaaca gaatttgcct ggcggcagta gcgcggtggt cccacctgac cccatgccga 9180 actcagaagt gaaacgccgt agcgccgatg gtagtgtggg gtctccccat gcgagagtag 9240 ggaactgcca ggcatcaaat aaaacgaaag gctcagtcga aagactgggc ctttcgtttt 9300 atctgttgtt tgtcggtgaa cgctctcctg agtaggacaa atccgccggg agcggatttg 9360 aacgttgcga agcaacggcc cggagggtgg cgggcaggac gcccgccata aactgccagg 9420 catcaaatta agcagaaggc catcctgacg gatggccttt ttgcgtttct acaaactctt 9480 tttgtttatt tttctaaata cattcaaata tgtatccgct gatccccatc ctatcgatcc 9540 ccacccgatg cccttgagag ccttca 9566 51 9412 DNA Artificial Sequence pEPEF20 51 acccgatgcc cttgagagcc ttcaacccag tcagctcctt ccggtgggcg cggggcatga 60 ctatcgtcgc cgcacttatg actgtcttct ttatcatgca actcgtagga caggtgccgg 120 cagcgctctg ggtcattttc ggcgaggacc gctttcgctg gagcgcgacg atgatcggcc 180 tgtcgcttgc ggtattcgga atcttgcacg ccctcgctca agccttcgtc actggtcccg 240 ccaccaaacg tttcggcgag aagcaggcca ttatcgccgg catggcggcc gacgcgctgg 300 gctacgtctt gctggcgttc gcgacgcgag gctggatggc cttccccatt atgattcttc 360 tcgcttccgg cggcatcggg atgcccgcgt tgcaggccat gctgtccagg caggtagatg 420 acgaccatca gggacagctt caaggatcgc tcgcggctct taccagccta acttcgatca 480 ctggaccgct gatcgtcacg gcgatttatg ccgcctcggc gagcacatgg aacgggttgg 540 catggattgt aggcgccgcc ctataccttg tctgcctccc cgcgttgcgt cgcggtgcat 600 ggagccgggc cacctcgacc tgaatggaag ccggcggcac ctcgctaacg gattcaccac 660 tccaagaatt ggagccaatc aattcttgcg gagaactgtg aatgcgcaaa ccaacccttg 720 gcagaacata tccatcgcgt ccgccatctc cagcagccgc acgcggcgca tctcgggcag 780 cgttgggtcc tggccacggg tgcgcatgat cgtgctcctg tcgttgagga cccggctagg 840 ctggcggggt tgccttactg gttagcagaa tgaatcaccg atacgcgagc gaacgtgaag 900 cgactgctgc tgcaaaacgt ctgcgacctg agcaacaaca tgaatggtct tcggtttccg 960 tgtttcgtaa agtctggaaa cgcggaagtc ccctacgtgc tgctgaagtt gcccgcaaca 1020 gagagtggaa ccaaccggtg ataccacgat actatgactg agagtcaacg ccatgagcgg 1080 cctcatttct tattctgagt tacaacagtc cgcaccgctg tccggtagct ccttccggtg 1140 ggcgcggggc atgactatcg tcgccgcact tatgactgtc ttctttatca tgcaactcgt 1200 aggacaggtg ccggcagcgc ccaacagtcc cccggccacg gggcctgcca ccatacccac 1260 gccgaaacaa gcgccctgca ccattatgtt ccggatctgc atcgcaggat gctgctggct 1320 accctgtgga acacctacat ctgtattaac gaagcgctaa ccgtttttat caggctctgg 1380 gaggcagaat aaatgatcat atcgtcaatt attacctcca cggggagagc ctgagcaaac 1440 tggcctcagg catttgagaa gcacacggtc acactgcttc cggtagtcaa taaaccggta 1500 aaccagcaat agacataagc ggctatttaa cgaccctgcc ctgaaccgac gaccgggtcg 1560 aatttgcttt cgaatttctg ccattcatcc gcttattatc acttattcag gcgtagcacc 1620 aggcgtttaa gggcaccaat aactgcctta aaaaaattac gccccgccct gccactcatc 1680 gcagtactgt tgtaattcat taagcattct gccgacatgg aagccatcac agacggcatg 1740 atgaacctga atcgccagcg gcatcagcac cttgtcgcct tgcgtataat atttgcccat 1800 ggtgaaaacg ggggcgaaga agttgtccat attggccacg tttaaatcaa aactggtgaa 1860 actcacccag ggattggctg agacgaaaaa catattctca ataaaccctt tagggaaata 1920 ggccaggttt tcaccgtaac acgccacatc ttgcgaatat atgtgtagaa actgccggaa 1980 atcgtcgtgg tattcactcc agagcgatga aaacgtttca gtttgctcat ggaaaacggt 2040 gtaacaaggg tgaacactat cccatatcac cagctcaccg tctttcattg ccatacgaga 2100 ggtttgatga ctttgacctt tggttttaaa tttttgaaaa aaataaaaaa aggcgaagcc 2160 tattatatat ttatcatata tattttaatc ttttgttctt ttgcgtgaaa aaaaaggcag 2220 tgttttcgct agttatagaa attaaacagt cacaaaaatc gatgtataca gtcacaaaaa 2280 tcgatttttg tgactctatg catcgatgta tacgagtcac aaaaatcgat gtacacagta 2340 cgacttttgt atttgtgtac tgtatatagt ataataaaag catagagaaa actcactatg 2400 aaatgacttt ctctatgcta ctactaaaac acgcaaagga gcgtatttat actatgatta 2460 taccagaaaa acaaaacaaa caaaagcagg tgctgacctt gaatgaatta gaaaaacgca 2520 aagtagtgga gcataacgct ttaattcaaa gtgtggctaa aatgcaaaaa acagctttaa 2580 aaatgtttga gttagccgtg tcttgtattg ataccgaaga acccccaaaa aataatactg 2640 tttatctttt aaaatctgaa ctttttaaat tttttgaagt atcgtctagc agtaagcata 2700 gtcaattcaa agaagcagtt aattatatgc aaaagcaagc ctttttcaat attaaagctg 2760 ataagaaact aggtattgag tatgaaagca ttgtgcctat tccttatgta aaatggaacg 2820 attataacga tgaagtaaca atacgttttg accaagccat tatgccctat ctgattgatt 2880 taaaggcaga atttactcaa tataaaattt ccgaactaca aaaactcaat agcaagtatt 2940 cgattatttt gtatcgttgg ttatccatga attataacca atacgaacat tacagcgtga 3000 aagggggacg gagagttgaa caagtggaat cctaccgcaa cccctcaata aaagtgaaag 3060 aattgcgaga aataactgat acaataaatg aacatcaaca ttttccccat tttgaaacta 3120 gagtattaaa aaaagcaatt gaagaaatca acgctcacac ctcttttaat gtaacctatg 3180 agaaaaagaa agcagggcgg agcattgact ctattgtctt tcatattgag aagaaacgca 3240 tggcagacga taacagttac aagttgggag ataaagacta tcaagacgat aaaaagcaaa 3300 aatcaagaaa tgaagctgac ttactaaaac aggcaatgga aagtaaatat acaagacttt 3360 tatctgaaaa tttcttaatt ggcatgaatg atattatgga tacagccaca atggtaggct 3420 tacagaagaa tgtttatccg ctttatgacg aactgaaaga attaagaggg ctaaatggtg 3480 tcaaagacca cttgtcttat gttgccagca aaaaagaagc ctattctaaa cgtaatgtag 3540 cgaaatatct taaaaaagcc attgaacaat atctgccaac ggttaaaagg caggacttaa 3600 accatgagtg aagacttaaa aacgataaaa gagttggcgg acgagctagg actagctaaa 3660 cagaccgttc aataccattt aaagtttttg ccgaccgaaa atcgacaaaa aaacaatagt 3720 ggttcgatcg tcctaaacct tgaagaacaa aagtttatca aaaaaagact aacaaaaata 3780 aaaaaagttc aaccgaccga aaatcgacaa gaaaaaatga agaattaagc gttaatagtc 3840 ttttgttagg cggatgagca ttcatcaggc gggcaagaat gtgaataaag gccggataaa 3900 acttgtgctt atttttcttt acggtcttta aaaaggccgt aatatccagc tgaacggtct 3960 ggttataggt acattgagca actgactgaa atgcctcaaa atgttcttta cgatgccatt 4020 gggatatatc aacggtggta tatccagtga tttttttctc cattttagct tccttagctc 4080 ctgaaaatct cgataactca aaaaatacgc ccggtagtga tcttatttca ttatggtgaa 4140 agttggaacc tcttacgtgc cgatcaacgt ctcattttcg ccaaaagttg gcccagggct 4200 tcccggtatc aacagggaca ccaggattta tttattctgc gaagtgatct tccgtcacag 4260 gtatttattc ggcgcaaagt gcgtcgggtg atgctgccaa cttactgatt tagtgtatga 4320 tggtgttttt gaggtgctcc agtggcttct gtttctatca gctgtccctc ctgttcagct 4380 actgacgggg tggtgcgtaa cggcaaaagc accgccggac atcagcgcta gcggagtgta 4440 tactggctta ctatgttggc actgatgagg gtgtcagtga agtgcttcat gtggcaggag 4500 aaaaaaggct gcaccggtgc gtcagcagaa tatgtgatac aggatatatt ccgcttcctc 4560 gctcactgac tcgctacgct cggtcgttcg actgcggcga gcggaaatgg cttacgaacg 4620 gggcggagat ttcctggaag atgccaggaa gatacttaac agggaagtga gagggccgcg 4680 gcaaagccgt ttttccatag gctccgcccc cctgacaagc atcacgaaat ctgacgctca 4740 aatcagtggt ggcgaaaccc gacaggacta taaagatacc aggcgtttcc ccctggcggc 4800 tccctcgtgc gctctcctgt tcctgccttt cggtttaccg gtgtcattcc gctgttatgg 4860 ccgcgtttgt ctcattccac gcctgacact cagttccggg taggcagttc gctccaagct 4920 ggactgtatg cacgaacccc ccgttcagtc cgaccgctgc gccttatccg gtaactatcg 4980 tcttgagtcc aacccggaaa gacatgcaaa agcaccactg gcagcagcca ctggtaattg 5040 atttagagga gttagtcttg aagtcatgcg ccggttaagg ctaaactgaa aggacaagtt 5100 ttggtgactg cgctcctcca agccagttac ctcggttcaa agagttggta gctcagagaa 5160 ccttcgaaaa accgccctgc aaggcggttt tttcgttttc agagcaagag attacgcgca 5220 gaccaaaacg atctcaagaa gatcatctta ttaatcagat aaaatatttc tagatttcag 5280 tgcaatttat ctcttcaaat gtagcacctg aagtcagccc catacgatat aagttgtaat 5340 tctcatgttt gacagcttat catcgataca aattccccgt aggcgctagg gacctcttta 5400 gcttcttgga agctgtcagt agtatatcta ataatttatc tccattccct ttagtaacgt 5460 gtaactttcc aaatttaaaa aagcgactca tagaattatt tcctcccgtt aaataataga 5520 taactattaa aaatagacaa tacttgctca taagtaatgg tacttaaatt gtttactttg 5580 gcgtgtttca ttgcttgatg aaactgattt ttagtaaaca gttgacgata ttctcgattg 5640 acccattttg aaacaaagta cgtatatagc ttccaatatt tatctggaac atctgtggta 5700 tggcgggtaa gttttattaa gacactgttt acttttggtt taggatgaaa gcattccgct 5760 ggcagcttaa gcaattgctg aatcgagact tgagtgtgca agagcaaccc tagtgttcgg 5820 tgaatatcca aggtacgctt gtagaatcct tcttcaacaa tcagatagat gtcagacgca 5880 cggctttcaa aaaccacttt tttaataatt tgtgtgctta aatggtaagg aatattccca 5940 acaattttat acctctgttt gttagggaat tgaaactgta gaatatcttg gtgaattaaa 6000 gtgacacgag tattcagttt taatttttct gacgataagt tgaatagatg actgtctaat 6060 tcaatagacg ttacctgttt acttatttta gccagtttcg tcgttaaatg ccctttacct 6120 gttccaattt cgtaaacggt atcggtttct tttaaattca attgttttat tatttggttg 6180 agtacttttt cactcgttaa aaagttttga gaatatttta tatttttgtt catgtaatct 6240 ctcctgaagt gattacatct ataaacaaat acagaagtta aacgatttgt ttgtaatttt 6300 agttatctgt ttaaaaagtc ataagattag tcactggtag gaattaatct aacgtattta 6360 tttatctgcg taatcactgt ttttagtctg tttcaaaaca gtagatgttt tatctacatt 6420 acgcatttgg aataccaaca tgacgaatcc ctccttctta attacaaatt tttagcatct 6480 aatttaactt caattcctat tatacaaaat tttaagataa tgcactatca acacactctt 6540 aagtttgctt ctaagtctta tttccataac ttcttttacg tttccgccat tctttgctgt 6600 ttcgattttt atgatatggt gcaagtcagc acgaacacga accgtcttat ctcccattat 6660 atcttttttt gcactgattg gtgtatcatt tcgtttttct ttttgtgcgc ctaaatttcc 6720 cacaatcact cacttctttc tatttcttct tattcttatt ttatcatcaa caatcacaaa 6780 tcacttgtga tttgtgataa gtgatttgtg ataagtgatt tgtgattaat atataaaagc 6840 cctctttaaa gggcttttat gtttattttg agaaagatat aaaatcaata tatccctttt 6900 ccccaatttt tacaacggca ttgtagggct ttttctcttt gtttttgatt tccttttacc 6960 agggtttctt ttccctccag taattctttt acattcgttt tggttagttt tttctttcta 7020 aaatgttcag ctaaagtaaa cttacattca ggataatttg aacaaccata aaacgatttt 7080 tttaatacaa tattgttgcc acacttagga cattttccta caatactttt ttctgcttct 7140 ttttctttct gttcctggta atcagaaaaa tttagttttt ctatatcgtt aggtacagct 7200 tccagtaaat gaacaatgaa ttttttgata ttcgtaataa agttctcttg attgccttct 7260 cttttaccga ttttttttaa atacgtttcc catttagccg tcatttcagc actcgttaaa 7320 aggtgctgac tttcaactgc ctggcacaat aattttcctt tttcagttac aacagcctgc 7380 aggtcgacga taaccacata acagtcataa aaccactcct ttttaacaaa ctttatcaca 7440 agaaatattt tggcattcta cgactataac ttaaatttat attttttact ttataatata 7500 taattgatta tagaataatg ttgctcatat cgtttgccaa catctagtac tcaaattaca 7560 ctatgttaca cttggtaata ttaaccgaac ttcccctgtc caaattagat aagaggtaat 7620 aataaatgga aaataatttt atagtaaatg aaaatgagaa gcgtgtatta aaacaaattt 7680 tcaataacag caatatttca cgaacacaaa tatcgaagaa tttagaactt aataaagcta 7740 ctatttctaa cattctgaac aacttaaaac acaagagttt agttaatgaa gtaggagaag 7800 gtaatagtac taaaagtggt ggacgaaagc ctattttact cgaaattaac caaaaatatg 7860 gctactatat ttctatggat ttaacatatg attccgttga attaatgtac aactactttg 7920 atgctactat attaaagcaa gattcctacg aattaaatga taaaaatgta agcagtatat 7980 tacaaatttt aaaatctaat ataaacgtct cagaaaaata tgatacgtta tatgggttac 8040 ttggtatatc tatatccata cacggtatcg ttgacgatga gcaaaacata atcaatcttc 8100 cttttcataa aaatgagaaa cgcacattta ccgatgaatt aaagtcattc acaaatgttc 8160 ctgtcgttat agaaaatgaa gcaaatttat cagcgctata tgaaaaaagt ttatatatta 8220 attcaaacat aaataatttg attactttaa gtattcacaa gggtataggc gctggcatca 8280 taataaataa aaaactttat cgtggctcaa atggagaggc tggagagata ggtaagacat 8340 tggttttgga atctataaat aacaatgaca acaaatatta taaaatcgaa gatatatgct 8400 cccaagacgc tttaatacag aaaataaata ataggttggg cgtcacattg acgtttacag 8460 aactaatcca atattacaac gaaggaaatt caattgttgc tcatgaaatt aaacaattta 8520 ttaataaaat gacagttctg attcataatt tgaatacaca atttaaccca gacgctattt 8580 atattaactg tcctttaatt aatgaattac caaatatttt aaatgaaatt aaagagcaat 8640 tctcctgttt ttctcaaggc agtccaattc aattacattt aactactaat gtaaaacaag 8700 ctactttatt gggtggcact ttagcaataa tgcaaaaaac attaaatata aataacattc 8760 aaatgaatat taaataatta cagcagtctg agttataaaa tagatatctc ggaccgtttc 8820 gtgaactttt ttgttgacaa agataaaaac acatgatata cttaaatcac tcgagagtta 8880 gtttgtttat taaattaacc aactggacga attcccgggg atccgtcgac ctgcagccaa 8940 gcttgggctt ttcagcctga tacagattaa atcagaacgc agaagcggtc tgataaaaca 9000 gaatttgcct ggcggcagta gcgcggtggt cccacctgac cccatgccga actcagaagt 9060 gaaacgccgt agcgccgatg gtagtgtggg gtctccccat gcgagagtag ggaactgcca 9120 ggcatcaaat aaaacgaaag gctcagtcga aagactgggc ctttcgtttt atctgttgtt 9180 tgtcggtgaa cgctctcctg agtaggacaa atccgccggg agcggatttg aacgttgcga 9240 agcaacggcc cggagggtgg cgggcaggac gcccgccata aactgccagg catcaaatta 9300 agcagaaggc catcctgacg gatggccttt ttgcgtttct acaaactctt tttgtttatt 9360 tttctaaata cattcaaata tgtatccgct gatccccatc ctatcgatcc cc 9412 52 9412 DNA Artificial Sequence pEPEF22 52 acccgatgcc cttgagagcc ttcaacccag tcagctcctt ccggtgggcg cggggcatga 60 ctatcgtcgc cgcacttatg actgtcttct ttatcatgca actcgtagga caggtgccgg 120 cagcgctctg ggtcattttc ggcgaggacc gctttcgctg gagcgcgacg atgatcggcc 180 tgtcgcttgc ggtattcgga atcttgcacg ccctcgctca agccttcgtc actggtcccg 240 ccaccaaacg tttcggcgag aagcaggcca ttatcgccgg catggcggcc gacgcgctgg 300 gctacgtctt gctggcgttc gcgacgcgag gctggatggc cttccccatt atgattcttc 360 tcgcttccgg cggcatcggg atgcccgcgt tgcaggccat gctgtccagg caggtagatg 420 acgaccatca gggacagctt caaggatcgc tcgcggctct taccagccta acttcgatca 480 ctggaccgct gatcgtcacg gcgatttatg ccgcctcggc gagcacatgg aacgggttgg 540 catggattgt aggcgccgcc ctataccttg tctgcctccc cgcgttgcgt cgcggtgcat 600 ggagccgggc cacctcgacc tgaatggaag ccggcggcac ctcgctaacg gattcaccac 660 tccaagaatt ggagccaatc aattcttgcg gagaactgtg aatgcgcaaa ccaacccttg 720 gcagaacata tccatcgcgt ccgccatctc cagcagccgc acgcggcgca tctcgggcag 780 cgttgggtcc tggccacggg tgcgcatgat cgtgctcctg tcgttgagga cccggctagg 840 ctggcggggt tgccttactg gttagcagaa tgaatcaccg atacgcgagc gaacgtgaag 900 cgactgctgc tgcaaaacgt ctgcgacctg agcaacaaca tgaatggtct tcggtttccg 960 tgtttcgtaa agtctggaaa cgcggaagtc ccctacgtgc tgctgaagtt gcccgcaaca 1020 gagagtggaa ccaaccggtg ataccacgat actatgactg agagtcaacg ccatgagcgg 1080 cctcatttct tattctgagt tacaacagtc cgcaccgctg tccggtagct ccttccggtg 1140 ggcgcggggc atgactatcg tcgccgcact tatgactgtc ttctttatca tgcaactcgt 1200 aggacaggtg ccggcagcgc ccaacagtcc cccggccacg gggcctgcca ccatacccac 1260 gccgaaacaa gcgccctgca ccattatgtt ccggatctgc atcgcaggat gctgctggct 1320 accctgtgga acacctacat ctgtattaac gaagcgctaa ccgtttttat caggctctgg 1380 gaggcagaat aaatgatcat atcgtcaatt attacctcca cggggagagc ctgagcaaac 1440 tggcctcagg catttgagaa gcacacggtc acactgcttc cggtagtcaa taaaccggta 1500 aaccagcaat agacataagc ggctatttaa cgaccctgcc ctgaaccgac gaccgggtcg 1560 aatttgcttt cgaatttctg ccattcatcc gcttattatc acttattcag gcgtagcacc 1620 aggcgtttaa gggcaccaat aactgcctta aaaaaattac gccccgccct gccactcatc 1680 gcagtactgt tgtaattcat taagcattct gccgacatgg aagccatcac agacggcatg 1740 atgaacctga atcgccagcg gcatcagcac cttgtcgcct tgcgtataat atttgcccat 1800 ggtgaaaacg ggggcgaaga agttgtccat attggccacg tttaaatcaa aactggtgaa 1860 actcacccag ggattggctg agacgaaaaa catattctca ataaaccctt tagggaaata 1920 ggccaggttt tcaccgtaac acgccacatc ttgcgaatat atgtgtagaa actgccggaa 1980 atcgtcgtgg tattcactcc agagcgatga aaacgtttca gtttgctcat ggaaaacggt 2040 gtaacaaggg tgaacactat cccatatcac cagctcaccg tctttcattg ccatacgaga 2100 ggtttgatga ctttgacctt tggttttaaa tttttgaaaa aaataaaaaa aggcgaagcc 2160 tattatatat ttatcatata tattttaatc ttttgttctt ttgcgtgaaa aaaaaggcag 2220 tgttttcgct agttatagaa attaaacagt cacaaaaatc gatgtataca gtcacaaaaa 2280 tcgatttttg tgactctatg catcgatgta tacgagtcac aaaaatcgat gtacacagta 2340 cgacttttgt atttgtgtac tgtatatagt ataataaaag catagagaaa actcactatg 2400 aaatgacttt ctctatgcta ctactaaaac acgcaaagga gcgtatttat actatgatta 2460 taccagaaaa acaaaacaaa caaaagcagg tgctgacctt gaatgaatta gaaaaacgca 2520 aagtagtgga gcataacgct ttaattcaaa gtgtggctaa aatgcaaaaa acagctttaa 2580 aaatgtttga gttagccgtg tcttgtattg ataccgaaga acccccaaaa aataatactg 2640 tttatctttt aaaatctgaa ctttttaaat tttttgaagt atcgtctagc agtaagcata 2700 gtcaattcaa agaagcagtt aattatatgc aaaagcaagc ctttttcaat attaaagctg 2760 ataagaaact aggtattgag tatgaaagca ttgtgcctat tccttatgta aaatggaacg 2820 attataacga tgaagtaaca atacgttttg accaagccat tatgccctat ctgattgatt 2880 taaaggcaga atttactcaa tataaaattt ccgaactaca aaaactcaat agcaagtatt 2940 cgattatttt gtatcgttgg ttatccatga attataacca atacgaacat tacagcgtga 3000 aagggggacg gagagttgaa caagtggaat cctaccgcaa cccctcaata aaagtgaaag 3060 aattgcgaga aataactgat acaataaatg aacatcaaca ttttccccat tttgaaacta 3120 gagtattaaa aaaagcaatt gaagaaatca acgctcacac ctcttttaat gtaacctatg 3180 agaaaaagaa agcagggcgg agcattgact ctattgtctt tcatattgag aagaaacgca 3240 tggcagacga taacagttac aagttgggag ataaagacta tcaagacgat aaaaagcaaa 3300 aatcaagaaa tgaagctgac ttactaaaac aggcaatgga aagtaaatat acaagacttt 3360 tatctgaaaa tttcttaatt ggcatgaatg atattatgga tacagccaca atggtaggct 3420 tacagaagaa tgtttatccg ctttatgacg aactgaaaga attaagaggg ctaaatggtg 3480 tcaaagacca cttgtcttat gttgccagca aaaaagaagc ctattctaaa cgtaatgtag 3540 cgaaatatct taaaaaagcc attgaacaat atctgccaac ggttaaaagg caggacttaa 3600 accatgagtg aagacttaaa aacgataaaa gagttggcgg acgagctagg actagctaaa 3660 cagaccgttc aataccattt aaagtttttg ccgaccgaaa atcgacaaaa aaacaatagt 3720 ggttcgatcg tcctaaacct tgaagaacaa aagtttatca aaaaaagact aacaaaaata 3780 aaaaaagttc aaccgaccga aaatcgacaa gaaaaaatga agaattaagc gttaatagtc 3840 ttttgttagg cggatgagca ttcatcaggc gggcaagaat gtgaataaag gccggataaa 3900 acttgtgctt atttttcttt acggtcttta aaaaggccgt aatatccagc tgaacggtct 3960 ggttataggt acattgagca actgactgaa atgcctcaaa atgttcttta cgatgccatt 4020 gggatatatc aacggtggta tatccagtga tttttttctc cattttagct tccttagctc 4080 ctgaaaatct cgataactca aaaaatacgc ccggtagtga tcttatttca ttatggtgaa 4140 agttggaacc tcttacgtgc cgatcaacgt ctcattttcg ccaaaagttg gcccagggct 4200 tcccggtatc aacagggaca ccaggattta tttattctgc gaagtgatct tccgtcacag 4260 gtatttattc ggcgcaaagt gcgtcgggtg atgctgccaa cttactgatt tagtgtatga 4320 tggtgttttt gaggtgctcc agtggcttct gtttctatca gctgtccctc ctgttcagct 4380 actgacgggg tggtgcgtaa cggcaaaagc accgccggac atcagcgcta gcggagtgta 4440 tactggctta ctatgttggc actgatgagg gtgtcagtga agtgcttcat gtggcaggag 4500 aaaaaaggct gcaccggtgc gtcagcagaa tatgtgatac aggatatatt ccgcttcctc 4560 gctcactgac tcgctacgct cggtcgttcg actgcggcga gcggaaatgg cttacgaacg 4620 gggcggagat ttcctggaag atgccaggaa gatacttaac agggaagtga gagggccgcg 4680 gcaaagccgt ttttccatag gctccgcccc cctgacaagc atcacgaaat ctgacgctca 4740 aatcagtggt ggcgaaaccc gacaggacta taaagatacc aggcgtttcc ccctggcggc 4800 tccctcgtgc gctctcctgt tcctgccttt cggtttaccg gtgtcattcc gctgttatgg 4860 ccgcgtttgt ctcattccac gcctgacact cagttccggg taggcagttc gctccaagct 4920 ggactgtatg cacgaacccc ccgttcagtc cgaccgctgc gccttatccg gtaactatcg 4980 tcttgagtcc aacccggaaa gacatgcaaa agcaccactg gcagcagcca ctggtaattg 5040 atttagagga gttagtcttg aagtcatgcg ccggttaagg ctaaactgaa aggacaagtt 5100 ttggtgactg cgctcctcca agccagttac ctcggttcaa agagttggta gctcagagaa 5160 ccttcgaaaa accgccctgc aaggcggttt tttcgttttc agagcaagag attacgcgca 5220 gaccaaaacg atctcaagaa gatcatctta ttaatcagat aaaatatttc tagatttcag 5280 tgcaatttat ctcttcaaat gtagcacctg aagtcagccc catacgatat aagttgtaat 5340 tctcatgttt gacagcttat catcgataca aattccccgt aggcgctagg gacctcttta 5400 gcttcttgga agctgtcagt agtatatcta ataatttatc tccattccct ttagtaacgt 5460 gtaactttcc aaatttaaaa aagcgactca tagaattatt tcctcccgtt aaataataga 5520 taactattaa aaatagacaa tacttgctca taagtaatgg tacttaaatt gtttactttg 5580 gcgtgtttca ttgcttgatg aaactgattt ttagtaaaca gttgacgata ttctcgattg 5640 acccattttg aaacaaagta cgtatatagc ttccaatatt tatctggaac atctgtggta 5700 tggcgggtaa gttttattaa gacactgttt acttttggtt taggatgaaa gcattccgct 5760 ggcagcttaa gcaattgctg aatcgagact tgagtgtgca agagcaaccc tagtgttcgg 5820 tgaatatcca aggtacgctt gtagaatcct tcttcaacaa tcagatagat gtcagacgca 5880 cggctttcaa aaaccacttt tttaataatt tgtgtgctta aatggtaagg aatattccca 5940 acaattttat acctctgttt gttagggaat tgaaactgta gaatatcttg gtgaattaaa 6000 gtgacacgag tattcagttt taatttttct gacgataagt tgaatagatg actgtctaat 6060 tcaatagacg ttacctgttt acttatttta gccagtttcg tcgttaaatg ccctttacct 6120 gttccaattt cgtaaacggt atcggtttct tttaaattca attgttttat tatttggttg 6180 agtacttttt cactcgttaa aaagttttga gaatatttta tatttttgtt catgtaatct 6240 ctcctgaagt gattacatct ataaacaaat acagaagtta aacgatttgt ttgtaatttt 6300 agttatctgt ttaaaaagtc ataagattag tcactggtag gaattaatct aacgtattta 6360 tttatctgcg taatcactgt ttttagtctg tttcaaaaca gtagatgttt tatctacatt 6420 acgcatttgg aataccaaca tgacgaatcc ctccttctta attacaaatt tttagcatct 6480 aatttaactt caattcctat tatacaaaat tttaagataa tgcactatca acacactctt 6540 aagtttgctt ctaagtctta tttccataac ttcttttacg tttccgccat tctttgctgt 6600 ttcgattttt atgatatggt gcaagtcagc acgaacacga accgtcttat ctcccattat 6660 atcttttttt gcactgattg gtgtatcatt tcgtttttct ttttgtgcgc ctaaatttcc 6720 cacaatcact cacttctttc tatttcttct tattcttatt ttatcatcaa caatcacaaa 6780 tcacttgtga tttgtgataa gtgatttgtg ataagtgatt tgtgattaat atataaaagc 6840 cctctttaaa gggcttttat gtttattttg agaaagatat aaaatcaata tatccctttt 6900 ccccaatttt tacaacggca ttgtagggct ttttctcttt gtttttgatt tccttttacc 6960 agggtttctt ttccctccag taattctttt acattcgttt tggttagttt tttctttcta 7020 aaatgttcag ctaaagtaaa cttacattca ggataatttg aacaaccata aaacgatttt 7080 tttaatacaa tattgttgcc acacttagga cattttccta caatactttt ttctgcttct 7140 ttttctttct gttcctggta atcagaaaaa tttagttttt ctatatcgtt aggtacagct 7200 tccagtaaat gaacaatgaa ttttttgata ttcgtaataa agttctcttg attgccttct 7260 cttttaccga ttttttttaa atacgtttcc catttagccg tcatttcagc actcgttaaa 7320 aggtgctgac tttcaactgc ctggcacaat aattttcctt tttcagttac aacagcctgc 7380 aggtcgacga taaccacata acagtcataa aaccactcct ttttaacaaa ctttatcaca 7440 agaaatattt tggcattcta cgactataac ttaaatttat attttttact ttataatata 7500 taattgatta tagaataatg ttgctcatat cgtttgccaa catctagtac tcaaattaca 7560 ctatgttaca cttggtaata ttaaccgaac ttcccctgtc caaattagat aagaggtaat 7620 aataaatgga aaataatttt atagtaaatg aaaatgagaa gcgtgtatta aaacaaattt 7680 tcaataacag caatatttca cgaacacaaa tatcgaagaa tttagaactt aataaagcta 7740 ctatttctaa cattctgaac aacttaaaac acaagagttt agttaatgaa gtaggagaag 7800 gtaatagtac taaaagtggt ggacgaaagc ctattttact cgaaattaac caaaaatatg 7860 gctactatat ttctatggat ttaacatatg attccgttga attaatgtac aactactttg 7920 atgctactat attaaagcaa gattcctacg aattaaatga taaaaatgta agcagtatat 7980 tacaaatttt aaaatctaat ataaacgtct cagaaaaata tgatacgtta tatgggttac 8040 ttggtatatc tatatccata cacggtatcg ttgacgatga gcaaaacata atcaatcttc 8100 cttttcataa aaatgagaaa cgcacattta ccgatgaatt aaagtcattc acaaatgttc 8160 ctgtcgttat agaaaatgaa gcaaatttat cagcgctata tgaaaaaagt ttatatatta 8220 attcaaacat aaataatttg attactttaa gtattcacaa gggtataggc gctggcatca 8280 taataaataa aaaactttat cgtggctcaa atggagaggc tggagagata ggtaagacat 8340 tggttttgga atctataaat aacaatgaca acaaatatta taaaatcgaa gatatatgct 8400 cccaagacgc tttaatacag aaaataaata ataggttggg cgtcacattg acgtttacag 8460 aactaatcca atattacaac gaaggaaatt caattgttgc tcatgaaatt aaacaattta 8520 ttaataaaat gacagttctg attcataatt tgaatacaca atttaaccca gacgctattt 8580 atattaactg tcctttaatt aatgaattac caaatatttt aaatgaaatt aaagagcaat 8640 tctcctgttt ttctcaaggc agtccaattc aattacattt aactactaat gtaaaacaag 8700 ctactttatt gggtggcact ttagcaataa tgcaaaaaac attaaatata aataacattc 8760 aaatgaatat taaataatta cagcagtctg agttataaaa tagatatctc ggaccgtttc 8820 gtgaactttt ttgttgacaa agataaaaac acatgatata attaaatcac tcgagagtta 8880 gtttgtttat taaattaacc aactggacga attcccgggg atccgtcgac ctgcagccaa 8940 gcttgggctt ttcagcctga tacagattaa atcagaacgc agaagcggtc tgataaaaca 9000 gaatttgcct ggcggcagta gcgcggtggt cccacctgac cccatgccga actcagaagt 9060 gaaacgccgt agcgccgatg gtagtgtggg gtctccccat gcgagagtag ggaactgcca 9120 ggcatcaaat aaaacgaaag gctcagtcga aagactgggc ctttcgtttt atctgttgtt 9180 tgtcggtgaa cgctctcctg agtaggacaa atccgccggg agcggatttg aacgttgcga 9240 agcaacggcc cggagggtgg cgggcaggac gcccgccata aactgccagg catcaaatta 9300 agcagaaggc catcctgacg gatggccttt ttgcgtttct acaaactctt tttgtttatt 9360 tttctaaata cattcaaata tgtatccgct gatccccatc ctatcgatcc cc 9412 53 6852 DNA Artificial Sequence pXyl-T5-DD p15a 53 tggcattcta cgactataac ttaaatttat attttttact ttataatata taattgatta 60 tagaataatg ttgctcatat cgtttgccaa catctagtac tcaaattaca ctatgttaca 120 cttggtaata ttaaccgaac ttcccctgtc caaattagat aagaggtaat aataaatgga 180 aaataatttt atagtaaatg aaaatgagaa gcgtgtatta aaacaaattt tcaataacag 240 caatatttca cgaacacaaa tatcgaagaa tttagaactt aataaagcta ctatttctaa 300 cattctgaac aacttaaaac acaagagttt agttaatgaa gtaggagaag gtaatagtac 360 taaaagtggt ggacgaaagc ctattttact cgaaattaac caaaaatatg gctactatat 420 ttctatggat ttaacatatg attccgttga attaatgtac aactactttg atgctactat 480 attaaagcaa gattcctacg aattaaatga taaaaatgta agcagtatat tacaaatttt 540 aaaatctaat ataaacgtct cagaaaaata tgatacgtta tatgggttac ttggtatatc 600 tatatccata cacggtatcg ttgacgatga gcaaaacata atcaatcttc cttttcataa 660 aaatgagaaa cgcacattta ccgatgaatt aaagtcattc acaaatgttc ctgtcgttat 720 agaaaatgaa gcaaatttat cagcgctata tgaaaaaagt ttatatatta attcaaacat 780 aaataatttg attactttaa gtattcacaa gggtataggc gctggcatca taataaataa 840 aaaactttat cgtggctcaa atggagaggc tggagagata ggtaagacat tggttttgga 900 atctataaat aacaatgaca acaaatatta taaaatcgaa gatatatgct cccaagacgc 960 tttaatacag aaaataaata ataggttggg cgtcacattg acgtttacag aactaatcca 1020 atattacaac gaaggaaatt caattgttgc tcatgaaatt aaacaattta ttaataaaat 1080 gacagttctg attcataatt tgaatacaca atttaaccca gacgctattt atattaactg 1140 tcctttaatt aatgaattac caaatatttt aaatgaaatt aaagagcaat tctcctgttt 1200 ttctcaaggc agtccaattc aattacattt aactactaat gtaaaacaag ctactttatt 1260 gggtggcact ttagcaataa tgcaaaaaac attaaatata aataacattc aaatgaatat 1320 taaataatta cagcagtctg agttataaaa tagatatctc ggaccgtcat aaaaaattta 1380 ttttacatca ggaaaatttt tctgtatatt agattcaagt tagtttgttt attaaattaa 1440 ccaactaaaa tgtagaattc gagctcggta cccggggatc ctctagagtc gacctgcagc 1500 caagcttggg cttttcagcc tgatacagat taaatcagaa cgcagaagcg gtctgataaa 1560 acagaatttg cctggcggca gtagcgcggt ggtcccacct gaccccatgc cgaactcaga 1620 agtgaaacgc cgtagcgccg atggtagtgt ggggtctccc catgcgagag tagggaactg 1680 ccaggcatca aataaaacga aaggctcagt cgaaagactg ggcctttcgt tttatctgtt 1740 gtttgtcggt gaacgctctc ctgagtagga caaatccgcc gggagcggat ttgaacgttg 1800 cgaagcaacg gcccggaggg tggcgggcag gacgcccgcc ataaactgcc aggcatcaaa 1860 ttaagcagaa ggccatcctg acggatggcc tttttgcgtt tctacaaact ctttttgttt 1920 atttttctaa atacattcaa atatgtatcc gctgatcccc atcctatcga tgataagctg 1980 tcaaacatga gaattaaatc aatctaaagt atatatgagt aaacttggtc tgacagttac 2040 caatgcttaa tcagtgaggc acctatctca gcgatctgtc tatttcgttc atccatagtt 2100 gcctgactcc ccgtcgtgta gataactacg atacgggagg gcttaccatc tggccccagt 2160 gctgcaatga taccgcgaga cccacgctca ccggctccag atttatcagc aataaaccag 2220 ccagccggaa gggccgagcg cagaagtggt cctgcaactt tatccgcctc catccagtct 2280 attaattgtt gccgggaagc tagagtaagt agttcgccag ttaatagttt gcgcaacgtt 2340 gttgccattg ctacaggcat cgtggtgtca cgctcgtcgt ttggtatggc ttcattcagc 2400 tccggttccc aacgatcaag gcgagttaca tgatccccca tgttgtgcaa aaaagcggtt 2460 agctccttcg gtcctccgat cgttgtcaga agtaagttgg ccgcagtgtt atcactcatg 2520 gttatggcag cactgcataa ttctcttact gtcatgccat ccgtaagatg cttttctgtg 2580 actggtgagt actcaaccaa gtcattctga gaatagtgta tgcggcgacc gagttgctct 2640 tgcccggcgt caacacggga taataccgcg ccacatagca gaactttaaa agtgctcatc 2700 attggaaaac gctcttcggg gcgaaaactc tcaaggatct taccgctgtt gagatccagt 2760 tcgatgtaac ccactcgtgc acccaactga tcttcagcat cttttacttt caccagcgtt 2820 tctgggtgag caaaaacagg aaggcaaaat gccgcaaaaa agggaataag ggcgacacgg 2880 aaatgttgaa tactcatact cttccttttt caatattatt gaagcattta tcagggttat 2940 tgtctcatga gcggatacat atttgaatgt atttagaaaa ataaacaaat aggggttccg 3000 cgcacatttc cccgaaaagt gccacctggc ggccgcataa gatgatcttc ttgagatcgt 3060 tttggtctgc gcgtaatctc ttgctctgaa aacgaaaaaa ccgccttgca gggcggtttt 3120 tcgaaggttc tctgagctac caactctttg aaccgaggta actggcttgg aggagcgcag 3180 tcaccaaaac ttgtcctttc agtttagcct taaccggcgc atgacttcaa gactaactcc 3240 tctaaatcaa ttaccagtgg ctgctgccag tggtgctttt gcatgtcttt ccgggttgga 3300 ctcaagacga tagttaccgg ataaggcgca gcggtcggac tgaacggggg gttcgtgcat 3360 acagtccagc ttggagcgaa ctgcctaccc ggaactgagt gtcaggcgtg gaatgagaca 3420 aacgcggcca taacagcgga atgacaccgg taaaccgaaa ggcaggaaca ggagagcgca 3480 cgagggagcc gccaggggaa acgcctggta tctttatagt cctgtcgggt ttcgccacca 3540 ctgatttgag cgtcagattt cgtgatgctt gtcagggggg cggagcctat ggaaaaacgg 3600 ctttgccgcg gccctctcac ttccctgtta agtatcttcc tggcatcttc caggaaatct 3660 ccgccccgtt cgtaagccat ttccgctcgc cgcagtcgaa cgaccgagcg tagcgagtca 3720 gtgagcgagg aagcggaata tatcctgtat cacatattct gctgacgcac cggtgcagcc 3780 ttttttctcc tgccacatga agcacttcac tgacaccctc atcagtgcca acatagtaag 3840 ccagtgcggc cgcttctttc ctgcgttatc ccctgattct gtggataacc gtattaccgc 3900 ctttgagtga gctgataccg ctcgccgcag ccgaacgacc gagcgcagcg agtcagtgag 3960 cgaggaagcg gaagagcgcc tgatgcggta ttttctcctt acgcatctgt gcggtatttc 4020 acaccgcata agatcccctc gacctgcagg catgcaagct tctgtaggtt tttaggcata 4080 aaactatatg atttacccct aaatctttaa aatgcccctt aaaattcaaa ataaaggcat 4140 ttaaaattta aatatttctt gtgataaagt ttgttaaaaa ggagtggttt tatgactgtt 4200 atgtggttat cgattatagg tatgtggttt tgtattggaa tggcattttt tgctatcaag 4260 gttattaaaa ataaaaatta gaccacgcat ttatgccgag aaaatttatt gtgcgttgag 4320 aagaaccctt aactaaactt gcagacgaat gtcggcatag cgtgagctat taagccgacc 4380 attcgacaag ttttgggatt gttaagggtt ccgaggctca acgtcaataa agcaattgga 4440 ataaagaagc gaaaaaggag aagtcggttc agaaaaagaa ggatatggat ctggagctgt 4500 aatataaaaa ccttcttcaa ctaacggggc aggttagtga cattagaaaa ccgactgtaa 4560 aaagtacagt cggcattatc tcatattata aaagccagtc attaggccta tctgacaatt 4620 cctgaataga gttcataaac aatcctgcat gataaccatc acaaacagaa tgatgtacct 4680 gtaaagatag cggtaaatat attgaattac ctttattaat gaattttcct gctgtaataa 4740 tgggtagaag gtaattacta ttattattga tatttaagtt aaacccagta aatgaagtcc 4800 atggaataat agaaagagaa aaagcatttt caggtatagg tgttttggga aacaatttcc 4860 ccgaaccatt atatttctct acatcagaaa ggtataaatc ataaaactct ttgaagtcat 4920 tctttacagg agtccaaata ccagagaatg ttttagatac accatcaaaa attgtataaa 4980 gtggctctaa cttatcccaa taacctaact ctccgtcgct attgtaacca gttctaaaag 5040 ctgtatttga gtttatcacc cttgtcacta agaaaataaa tgcagggtaa aatttatatc 5100 cttcttgttt tatgtttcgg tataaaacac taatatcaat ttctgtggtt atactaaaag 5160 tcgtttgttg gttcaaataa tgattaaata tctcttttct cttccaattg tctaaatcaa 5220 ttttattaaa gttcatttga tatgcctcct aaatttttat ctaaagtgaa tttaggaggc 5280 ttacttgtct gctttcttca ttagaatcaa tcctttttta aaagtcaata ttactgtaac 5340 ataaatatat attttaaaaa tatcccactt tatccaattt tcgtttgttg aactaatggg 5400 tgctttagtt gaagaataaa gaccacatta aaaaatgtgg tcttttgtgt ttttttaaag 5460 gatttgagcg tagcgaaaaa tccttttctt tcttatcttg ataataaggg taactattgc 5520 cggcgaggct agttaccctt aagttattgg tatgactggt tttaagcgca aaaaaagttg 5580 ctttttcgta cctattaatg tatcgtttta aatgactagt aaaaaacata catagaaagg 5640 ggaaaaagca acttttttta ttgtcatagt ttgtgaaaac taagttgttt ttatgtgtta 5700 taacatggaa aagtatactg agaaaaaaca aagaaatcaa gtatttcaga aatttattaa 5760 acgtcatatt ggagagaatc aaatggattt agttgaagat tgcaatacat ttctgtcttt 5820 tgtagctgat aaaactttag aaaaacagaa attatataaa gctaattctt gtaaaaatcg 5880 attttgtcct gtctgtgctt ggagaaaagc tagaaaagat gcattgggtt tatctttgat 5940 gatgcaatat attaagcagc aagagaaaaa ggagtttatc tttttaactt tgactacacc 6000 taatgtaatg agtgatgaat tagaaaatga aataaaacgt tataataatt cttttagaaa 6060 acttataaag agaaaaaaag taggtagtgt tataaaggga tatgttcgta agttagagat 6120 tacatataat aaaaaaagag atgattataa tcctcatttt catgtgttaa ttgcagtaaa 6180 taaatcgtat ttcacagata aaagatatta tattagccaa caagaatggt tagatttatg 6240 gcgtgatgta acgggcattt cagaaataac acaagttcaa gttcaaaaaa taagacaaaa 6300 taataataaa gaattatatg aaatggctaa gtattctggt aaagatagtg attatttaat 6360 aaataaatca aaaagtcttt gatgcatttt ataaatcact taaaggtaaa caggtattag 6420 tttattcagg attatttaaa gaggctaaaa agaaattaaa aaatggggat ttagattact 6480 taaaagaaat tgatccaacc gtaatatatc tatcaaattt tttatatttg gaaacaaaaa 6540 gagtatttag ctagtgaact ttatgactta acagaacaag aaaaaagaga aattaatcac 6600 aaaatgatag acgaaatcga ggaagaacaa taacaaaata taagtgctaa cagtcgtctg 6660 caagtttagt taagggttct tctcaacgca caataaattt tctcggcata aatgcgtggt 6720 ctaattttta tttttaataa ccttgatagc aaaaaatgcc attccaatac aaaaccacat 6780 acctataatc gataaccaca taacagtcat aaaaccactc ctttttaaca aactttatca 6840 caagaaatat tt 6852 54 16 DNA Artificial Sequence OR1 operator sequence 54 tatcaccgca gaggta 16 55 17 DNA Artificial Sequence OR2 operator sequence 55 taacaccgtg cgtgttg 17 56 17 DNA Artificial Sequence OR3 operator sequence 56 tatcaccgca agggata 17 57 17 DNA Artificial Sequence OL1 operator sequence 57 tatcaccgcc agtggta 17 58 17 DNA Artificial Sequence OL2 operator sequence 58 tatctctggc ggtgttg 17 59 17 DNA Artificial Sequence OL3 operator sequence 59 tatcaccgca gatggtt 17 60 15 DNA Artificial Sequence MalM operator sequence 60 cgcaaacgtt tgcgt 15 61 14 DNA Artificial Sequence MalX operator sequence 61 cgcaaacgtt ttgg 14 62 38 DNA Escherichia coli 62 actttatcac tgataaacaa acttatcagt gataaaga 38 63 38 DNA Escherichia coli 63 actctatcat tgatagagtt ccctatcagt gatagaga 38 64 38 DNA Plasmid pSC101 64 agcttatcat cgataagcta gtttatcaca gttaaatt 38 65 39 DNA Salmonella 65 actctatcat tgataaggga actctatcaa tgataggga 39 66 38 DNA Transposon Tn10 66 aatctatcac tgatagagta ccctatcatc gatagaga 38 67 20 DNA Artificial Sequence Trp O operator sequence 67 cgaactagtt aactagtacg 20 68 29 DNA Artificial Sequence Xyl O operator sequence 68 agttagtttg tttattaaat taaccaact 29 

What is claimed is:
 1. An isolated nucleic acid comprising a fusion promoter said fusion promoter comprising at least one promoter that is modified to have altered activity in at least one gram-positive organism said promoter being linked to at least one operator selected from the group consisting of xylO, tetO, trpO, malO and λc1O, wherein said at least one operator is positioned such that binding of at least one repressor to said at least one operator represses transcription from said fusion promoter.
 2. The isolated fusion promoter of claim 1, wherein said at least one promoter is selected from the group consisting of SEQ ID NOs.: 36-45.
 3. The isolated nucleic acid of claim 1, wherein said at least one operator is xylO.
 4. The isolated nucleic acid of claim 3, wherein said at least one promoter is T5.
 5. The isolated nucleic acid of claim 3, further comprising a second operator.
 6. The isolated nucleic acid of claim 5, wherein said second operator is lacO.
 7. The isolated nucleic acid of claim 1, wherein said fusion promoter is responsive to an inducer.
 8. The isolated nucleic acid of claim 7, wherein said inducer is xylose.
 9. The isolated nucleic acid of claim 1, wherein said fusion promoter is titratable.
 10. The isolated nucleic acid of claim 1, wherein said at least one gram-positive organism is selected from the group consisting of Bacillus anthracis, Clostridium botulinum, Clostridium difficile, Clostridium perfringens, Clostridium teteni, Corynebacterium diptheriae, Enterococcus faecalis, Enterococcus faecium, Lactococcus lactis, Listeria monocytogenes, Mycobacterium leprae, Mycobacterium tuberculosis, Nocardia asteroides, Staphylococcus aureus, Staphylococcus epidermidis, Staphylococcus xylosis, Streptococcus pneumoniae, Streptococcus mutans and any species falling within the genera of any of the above species.
 11. The isolated nucleic acid of claim 1, wherein said at least one gram-positive organism is Staphylococcus aureus.
 12. The isolated nucleic acid of claim 1, wherein said at least one gram-positive organism is Enterococcus faecalis.
 13. An isolated nucleic acid comprising a fusion promoter said fusion promoter comprising at least one promoter selected from the group consisting of T5, CP25, P32, P59, P1P2 and PL, said promoter being linked to at least one operator selected from the group consisting of xylO, tetO, trpO, malO and λc1O, wherein said at least one operator is positioned such that binding of at least one repressor to said at least one operator represses transcription from said fusion promoter.
 14. The isolated nucleic acid of claim 13, wherein said at least one operator is xylO.
 15. The isolated nucleic acid of claim 14, wherein said at least one promoter is T5.
 16. The isolated nucleic acid of claim 14, further comprising a second operator.
 17. The isolated nucleic acid of claim 16, wherein said second operator is lacO.
 18. The isolated nucleic acid of claim 13, wherein said fusion promoter is responsive to an inducer.
 19. The isolated nucleic acid of claim 18, wherein said inducer is xylose.
 20. The isolated nucleic acid of claim 13, wherein said fusion promoter is titratable.
 21. The isolated nucleic acid of claim 13, wherein said at least one gram-positive organism is selected from the group consisting of Bacillus anthracis, Clostridium botulinum, Clostridium difficile, Clostridium perfringens, Clostridium teteni, Corynebacterium diptheriae, Enterococcus faecalis, Enterococcus faecium, Lactococcus lactis, Listeria monocytogenes, Mycobacterium leprae, Mycobacterium tuberculosis, Nocardia asteroides, Staphylococcus aureus, Staphylococcus epidermidis, Staphylococcus xylosis, Streptococcus pneumoniae, Streptococcus mutans and any species falling within the genera of any of the above species.
 22. The isolated nucleic acid of claim 13, wherein said at least one gram-positive organism is Staphylococcus aureus.
 23. The isolated nucleic acid of claim 13, wherein said at least one gram-positive organism is Enterococcus faecalis.
 24. An isolated fusion promoter comprising one of SEQ ID NO.: 26-35.
 25. A vector comprising the isolated nucleic acid of claim
 1. 26. A vector comprising the isolated nucleic acid of claim
 13. 27. A vector comprising the isolated nucleic acid of claim
 24. 28. The vector of claim 25 further comprising at least one replicon selected from the group consisting of p15a, pC194 and pCT1138.
 29. The vector of claim 25 further comprising a reporter gene operably linked to said fusion promoter.
 30. The vector of claim 29, wherein said reporter gene is lacL-lacM.
 31. The vector of claim 30, wherein lacL-lacM is derived from Leuconostoc mesenteroides.
 32. The vector of claim 25, wherein said at least one operator is xylO.
 33. The vector of claim 32, wherein said at least one promoter is T5.
 34. The vector of claim 32, further comprising a second operator.
 35. The vector of claim 34, wherein said second operator is lacO.
 36. The vector of claim 25, wherein said fusion promoter is responsive to an inducer.
 37. The vector of claim 36, wherein said inducer is xylose.
 38. The vector of claim 25, wherein said fusion promoter is titratable.
 39. The vector of claim 25, wherein said at least one gram-positive organism is selected from the group consisting of Bacillus anthracis, Clostridium botulinum, Clostridium difficile, Clostridium perfringens, Clostridium teteni, Corynebacterium diptheriae, Enterococcus faecalis, Enterococcus faecium, Lactococcus lactis, Listeria monocytogenes, Mycobacterium leprae, Mycobacterium tuberculosis, Nocardia asteroides, Staphylococcus aureus, Staphylococcus epidermidis, Staphylococcus xylosis, Streptococcus pneumoniae, Streptococcus mutans and any species falling within the genera of any of the above species.
 40. The vector of claim 25, wherein said at least one gram-positive organism is Staphylococcus aureus.
 41. The vector of claim 25, wherein said at least one gram-positive organism is Enterococcus faecalis.
 42. The vector of claim 25, further comprising a random fragment of a microbial genome operably linked to said fusion promoter.
 43. The vector of claim 25, further comprising a nucleic acid that encodes a peptide, wherein said nucleic acid is operably linked to said fusion promoter.
 44. The vector of claim 25, further comprising a nucleic acid that is complementary to a portion of a microbial genome, wherein said nucleic acid is operably linked to said fusion promoter.
 45. The vector of claim 25, further comprising a nucleic acid that encodes a molecule that inhibits the proliferation of microbe, wherein said nucleic acid is operably linked to said fusion promoter.
 46. A host cell comprising the nucleic acid of claim
 1. 47. A host cell comprising the nucleic acid of claim 13
 48. A method for identifying genes involved in cellular proliferation said method comprising the steps of: (a) introducing into cells of a cell population a construct comprising an inducible fusion promoter operably linked to a nucleic acid, said fusion promoter comprising at least one promoter that is modified to have altered activity in at least one gram-positive organism said promoter being linked to at least one operator selected from the group consisting of xylO, tetO, trpO, malO and λc1O, wherein said at least one operator is positioned such that binding of at least one repressor to said at least one operator represses transcription from said fusion promoter; (b) inducing transcription of said nucleic acid from said inducible fusion promoter; (c) identifying the cells in said cell population whose proliferation is reduced in response to the induction of transcription of said nucleic acid; and (d) identifying the gene from a cell identified in step (c) to which at least a portion of said nucleic acid is complementary.
 49. A method for identifying genes involved in cellular proliferation said method comprising the steps of: (a) introducing into cells of a cell population a construct comprising an inducible fusion promoter operably linked to a nucleic acid, said fusion promoter comprising at least one promoter selected from the group consisting of T5, CP25, P32, P59, P1P2 and PL, said promoter being linked to at least one operator selected from the group consisting of xylO, tetO, trpO, malO and λc1O, wherein said at least one operator is positioned such that binding of at least one repressor to said at least one operator represses transcription from said fusion promoter; (b) inducing transcription of said nucleic acid from said inducible fusion promoter; (c) identifying the cells in said cell population whose proliferation is reduced in response to the induction of transcription of said nucleic acid; and (d) identifying the gene from a cell identified in step (c) to which at least a portion of said nucleic acid is complementary.
 50. The method of claim 48, wherein said nucleic acid is random fragment of a microbial genome.
 51. The method of claim 48, wherein said nucleic acid encodes an aptamer.
 52. The method of claim 48, wherein said nucleic acid encodes a transcript that is complementary to a portion of a microbial genome.
 53. The method of claim 48, wherein said nucleic acid encodes a molecule that inhibits the proliferation of a microbe.
 54. The method of claim 48, wherein said at least one operator is xylO.
 55. The method of claim 54, wherein said fusion promoter is T5.
 56. The method of claim 54, wherein said fusion promoter further comprises a second operator.
 57. The method of claim 56, wherein said second operator is lacO.
 58. The method of claim 54, wherein said inducer is xylose.
 59. The method of claim 54, wherein said fusion promoter is titratable.
 60. The method of claim 48, wherein said at least one gram-positive organism is selected from the group consisting of Bacillus anthracis, Clostridium botulinum, Clostridium difficile, Clostridium perfringens, Clostridium teteni, Corynebacterium diptheriae, Enterococcus faecalis, Enterococcus faecium, Lactococcus lactis, Listeria monocytogenes, Mycobacterium leprae, Mycobacterium tuberculosis, Nocardia asteroides, Staphylococcus aureus, Staphylococcus epidermidis, Staphylococcus xylosis, Streptococcus pneumoniae, Streptococcus mutans and any species falling within the genera of any of the above species.
 61. The method of claim 48, wherein said at least one gram-positive organism is Staphylococcus aureus.
 62. The method of claim 48, wherein said at least one gram-positive organism is Enterococcus faecalis.
 63. A method for identifying genes involved in cellular proliferation said method comprising the steps of: (a) introducing into the genome of a cell a construct comprising an inducible fusion promoter operably linked to a nucleic acid, said fusion promoter comprising at least one promoter that is modified to have altered activity in at least one gram-positive organism said promoter being linked to at least one operator selected from the group consisting of xylO, tetO, trpO, malO and λc1O, wherein said at least one operator is positioned such that binding of at least one repressor to said at least one operator represses transcription from said fusion promoter; (b) comparing the proliferation of said cell cultured in the presence of a first concentration of an inducer that induces transcription from said fusion promoter with the proliferation of said cell cultured in the presence of a concentration of said inducer that is less than said first concentration, wherein a difference in proliferation indicates that said fusion promoter modulates a gene that is required for proliferation; and (c) identifying the gene that is modulated by said fusion promoter.
 64. A method for identifying genes involved in cellular proliferation said method comprising the steps of: (a) introducing into the genome of a cell a construct comprising an inducible fusion promoter operably linked to a nucleic acid, said fusion promoter comprising at least one promoter selected from the group consisting of T5, CP25, P32, P59, P1P2 and PL, said promoter being linked to at least one operator selected from the group consisting of xylO, tetO, trpO, malO and λc1O, wherein said at least one operator is positioned such that binding of at least one repressor to said at least one operator represses transcription from said fusion promoter; (b) comparing the proliferation of said cell cultured in the presence of a first concentration of an inducer that induces transcription from said fusion promoter with the proliferation of said cell cultured in the presence of a concentration of said inducer that is less than said first concentration, wherein a difference in proliferation indicates that said fusion promoter modulates a gene that is required for proliferation; and (c) identifying the gene that is modulated by said fusion promoter.
 65. The method of claim 63, wherein said nucleic acid is random fragment of a microbial genome.
 66. The method of claim 63, wherein said nucleic acid encodes an aptamer.
 67. The method of claim 63, wherein said nucleic acid encodes a transcript that is complementary to a portion of a microbial genome.
 68. The method of claim 63, wherein said nucleic acid encodes a molecule that inhibits the proliferation of a microbe.
 69. The method of claim 63, wherein said at least one operator is xylO.
 70. The method of claim 69, wherein said at least one promoter is T5.
 71. The method of claim 69, wherein said fusion promoter further comprises a second operator.
 72. The method of claim 71 wherein said second operator is lacO.
 73. The method of claim 63, wherein said inducer is xylose.
 74. The method of claim 63, wherein said fusion promoter is titratable.
 75. The method of claim 63, wherein said at least one gram-positive organism is selected from the group consisting of Bacillus anthracis, Clostridium botulinum, Clostridium difficile, Clostridium perfringens, Clostridium teteni, Corynebacterium diptheriae, Enterococcus faecalis, Enterococcus faecium, Lactococcus lactis, Listeria monocytogenes, Mycobacterium leprae, Mycobacterium tuberculosis, Nocardia asteroides, Staphylococcus aureus, Staphylococcus epidermidis, Staphylococcus xylosis, Streptococcus pneumoniae, Streptococcus mutans and any species falling within the genera of any of the above species.
 76. The method of claim 63, wherein said at least one gram-positive organism is Staphylococcus aureus.
 77. The method of claim 63, wherein said at least one gram-positive organism is Enterococcus faecalis.
 78. A method for identifying a compound which reduces the activity or level of a gene product required for proliferation of a cell said method comprising: (a) introducing into a cell a construct comprising an inducible fusion promoter operably linked to a nucleic acid that is complementary to at least a portion of a proliferation-required gene, said fusion promoter comprising at least one promoter that is modified to have altered activity in at least one gram-positive organism said promoter being linked to at least one operator selected from the group consisting of xylO, tetO, trpO, malO and λc1O, wherein said at least one operator is positioned such that binding of at least one repressor to said at least one operator represses transcription from said fusion promoter; (b) sensitizing said cell by inducing transcription from said fusion promoter; (c) contacting said sensitized cell with a compound; and (d) determining the degree to which said compound inhibits proliferation of said sensitized cell relative to a cell which has not been sensitized.
 79. A method for identifying a compound which reduces the activity or level of a gene product required for proliferation of a cell said method comprising: (a) introducing into a cell a construct comprising an inducible fusion promoter operably linked to a nucleic acid that is complementary to at least a portion of a proliferation-required gene, said fusion promoter comprising at least one promoter selected from the group consisting of T5, CP25, P32, P59, P1P2 and PL, said promoter being linked to at least one operator selected from the group consisting of xylO, tetO, trpO, malO and λc1O, wherein said at least one operator is positioned such that binding of at least one repressor to said at least one operator represses transcription from said fusion promoter; (b) sensitizing said cell by inducing transcription from said fusion promoter; (c) contacting said sensitized cell with a compound; and (d) determining the degree to which said compound inhibits proliferation of said sensitized cell relative to a cell which has not been sensitized.
 80. The method of claim 78, wherein said at least one operator is xylO.
 81. The method of claim 80, wherein said at least one promoter is T5.
 82. The method of claim 80, wherein said fusion promoter further comprises a second operator.
 83. The method of claim 82, wherein said second operator is lacO.
 84. The method of claim 78, wherein said inducer is xylose.
 85. The method of claim 78, wherein said fusion promoter is titratable.
 86. The method of claim 78, wherein said at least one gram-positive organism is selected from the group consisting of Bacillus anthracis, Clostridium botulinum, Clostridium difficile, Clostridium perfringens, Clostridium teteni, Corynebacterium diptheriae, Enterococcus faecalis, Enterococcus faecium, Lactococcus lactis, Listeria monocytogenes, Mycobacterium leprae, Mycobacterium tuberculosis, Nocardia asteroides, Staphylococcus aureus, Staphylococcus epidermidis, Staphylococcus xylosis, Streptococcus pneumoniae, Streptococcus mutans and any species falling within the genera of any of the above species.
 87. The method of claim 78, wherein said at least one gram-positive organism is Staphylococcus aureus.
 88. The method of claim 78, wherein said at least one gram-positive organism is Enterococcus faecalis.
 89. A compound identified using the method of claim
 78. 90. A method for inhibiting the activity or expression of a gene in an operon required for proliferation said method comprising: (a) introducing into a cell a construct comprising an inducible fusion promoter operably linked to a nucleic acid that is complementary to at least a portion of a proliferation-required operon, said fusion promoter comprising at least one promoter that is modified to have altered activity in at least one gram-positive organism said promoter being linked to at least one operator selected from the group consisting of xylO, tetO, trpO, malO and λc1O, wherein said at least one operator is positioned such that binding of at least one repressor to said at least one operator represses transcription from said fusion promoter; and (b) inducing transcription from said fusion promoter.
 91. A method for inhibiting the activity or expression of a gene in an operon required for proliferation said method comprising: (a) introducing into a cell a construct comprising an inducible fusion promoter operably linked to a nucleic acid that is complementary to at least a portion of a proliferation-required operon, said fusion promoter comprising at least one promoter selected from the group consisting of T5, CP25, P32, P59, P1P2 and PL, said promoter being linked to at least one operator selected from the group consisting of xylO, tetO, trpO, malO and λc1O, wherein said at least one operator is positioned such that binding of at least one repressor to said at least one operator represses transcription from said fusion promoter; and (b) inducing transcription from said fusion promoter.
 92. The method of claim 90, wherein said at least one operator is xylO.
 93. The method of claim 92, wherein said at least one promoter is T5.
 94. The method of claim 92, wherein said fusion promoter further comprises a second operator.
 95. The method of claim 94, wherein said second operator is lacO.
 96. The method of claim 90, wherein said inducer is xylose.
 97. The method of claim 90, wherein said fusion promoter is titratable.
 98. The method of claim 90, wherein said at least one gram-positive organism is selected from the group consisting of Bacillus anthracis, Clostridium botulinum, Clostridium difficile, Clostridium perfringens, Clostridium teteni, Corynebacterium diptheriae, Enterococcus faecalis, Enterococcus faecium, Lactococcus lactis, Listeria monocytogenes, Mycobacterium leprae, Mycobacterium tuberculosis, Nocardia asteroides, Staphylococcus aureus, Staphylococcus epidermidis, Staphylococcus xylosis, Streptococcus pneumoniae, Streptococcus mutans and any species falling within the genera of any of the above species.
 99. The method of claim 90, wherein said at least one gram-positive organism is Staphylococcus aureus.
 100. The method of claim 90, wherein said at least one gram-positive organism is Enterococcus faecalis.
 101. A method of manufacturing an antibiotic comprising the steps of: (a) introducing into a cell a construct comprising an inducible fusion promoter operably linked to a nucleic acid that is complementary to at least a portion of a proliferation-required gene, said fusion promoter comprising at least one promoter that is modified to have altered activity in at least one gram-positive organism said promoter being linked to at least one operator selected from the group consisting of xylO, tetO, trpO, malO and λc1O, wherein said at least one operator is positioned such that binding of at least one repressor to said at least one operator represses transcription from said fusion promoter; (b) sensitizing said cell by inducing transcription from said fusion promoter; (c) contacting said sensitized cell with a compound; (d) identifying a compound which substantially inhibits the proliferation of said sensitized cell relative to a cell which has not been sensitized; and (e) manufacturing the compound so identified.
 102. A method of manufacturing an antibiotic comprising the steps of: (a) introducing into a cell a construct comprising an inducible fusion promoter operably linked to a nucleic acid that is complementary to at least a portion of a proliferation-required gene, said fusion promoter comprising at least one promoter selected from the group consisting of T5, CP25, P32, P59, P1P2 and PL, said promoter being linked to at least one operator selected from the group consisting of xylO, tetO, trpO, malO and λc1O, wherein said at least one operator is positioned such that binding of at least one repressor to said at least one operator represses transcription from said fusion promoter; (b) sensitizing said cell by inducing transcription from said fusion promoter; (c) contacting said sensitized cell with a compound; (d) identifying a compound which substantially inhibits the proliferation of said sensitized cell relative to a cell which has not been sensitized; and (e) manufacturing the compound so identified.
 103. The method of claim 101, wherein said at least one operator is xylO.
 104. The method of claim 103, wherein said at least one promoter is T5.
 105. The method of claim 103, wherein said fusion promoter further comprises a second operator.
 106. The method of claim 105, wherein said second operator is lacO.
 107. The method of claim 101, wherein said inducer is xylose.
 108. The method of claim 101, wherein said fusion promoter is titratable.
 109. The method of claim 101, wherein said at least one gram-positive organism is selected from the group consisting of Bacillus anthracis, Clostridium botulinum, Clostridium difficile, Clostridium perfringens, Clostridium teteni, Corynebacterium diptheriae, Enterococcus faecalis, Enterococcus faecium, Lactococcus lactis, Listeria monocytogenes, Mycobacterium leprae, Mycobacterium tuberculosis, Nocardia asteroides, Staphylococcus aureus, Staphylococcus epidermidis, Staphylococcus xylosis, Streptococcus pneumoniae, Streptococcus mutans and any species falling within the genera of any of the above species.
 110. The method of claim 101, wherein said at least one gram-positive organism is Staphylococcus aureus.
 111. The method of claim 101, wherein said at least one gram-positive organism is Enterococcusfaecalis.
 112. A method for identifying a gene which is required for proliferation of a prokaryotic cell said method comprising the steps of: (a) replacing the native promoter of a gene in the chromosome of a prokaryotic cell having an enhanced frequency of homologous recombination with a regulatable fusion promoter comprising at least one promoter that is modified to have altered activity in at least one gram-positive organism said promoter being linked to at least one operator selected from the group consisting of xylO, tetO, trpO, malO, lacO and λc1O, wherein said at least one operator is positioned such that binding of at least one repressor to said at least one operator represses transcription from said fusion promoter; and (b) identifying cells in which the extent of proliferation of said cell when said fusion promoter is active at a first level is substantially different than the extent of proliferation of said cell when said fusion promoter is active at a second level, said first level being greater than said second level.
 113. A method for identifying a gene which is required for proliferation of a prokaryotic cell said method comprising the steps of: (a) replacing the native promoter of a gene in the chromosome of a prokaryotic cell having an enhanced frequency of homologous recombination with a regulatable fusion promoter comprising at least one promoter selected from the group consisting of T5, CP25, P32, P59, P1P2 and PL, said promoter being linked to at least one operator selected from the group consisting of xylO, tetO, trpO, malO, lacO and λc1O, wherein said at least one operator is positioned such that binding of at least one repressor to said at least one operator represses transcription from said fusion promoter; and (b) identifying cells in which the extent of proliferation of said cell when said fusion promoter is active at a first level is substantially different than the extent of proliferation of said cell when said fusion promoter is active at a second level, said first level being greater than said second level.
 114. The method of claim 112, wherein said fusion promoter is an inducible promoter.
 115. The method of claim 112, wherein the step of replacing said native promoter comprises introducing a linear nucleic acid comprising a 5′ portion homologous to a first portion of said native promoter, a 3′ portion homologous to a second portion of said native promoter and said fusion promoter disposed between said 5′ portion and said 3′ portion into said cell such that homologous recombination occurs between said 5′ portion and said first portion of said native promoter and between said 3′ portion and said second portion of said native promoter.
 116. The method of claim 115, wherein said linear nucleic acid is double stranded.
 117. The method of claim 115, wherein said linear nucleic acid is single stranded.
 118. A method for identifying a compound which inhibits the proliferation of a prokaryotic cell said method comprising the steps of: (a) replacing the native promoter of a gene in the chromosome of a prokaryotic cell having an enhanced frequency of homologous recombination with a regulatable fusion promoter comprising at least one promoter that is modified to have altered activity in at least one gram-positive organism said promoter being linked to at least one operator selected from the group consisting of xylO, tetO, trpO, malO, lacO and λc1O, wherein said at least one operator is positioned such that binding of at least one repressor to said at least one operator represses transcription from said fusion promoter; and (b) comparing the extent of proliferation of a first sample of said cell in the presence of said compound to the extent of proliferation of a second sample of said cell in the presence of said compound, wherein said first sample of said cell has a reduced activity of said fusion promoter activity relative the activity of said fusion promoter in said second sample of said cell and wherein said compound inhibits the proliferation of said cell if the extent of proliferation of said first sample of said cell is substantially less than the extent of proliferation of said second sample of said cell.
 119. A method for identifying a compound which inhibits the proliferation of a prokaryotic cell said method comprising the steps of: (a) replacing the native promoter of a gene in the chromosome of a prokaryotic cell having an enhanced frequency of homologous recombination with a regulatable fusion promoter comprising at least one promoter selected from the group consisting of T5, CP25, P32, P59, P1P2 and PL, said promoter being linked to at least one operator selected from the group consisting of xylO, tetO, trpO, malO, lacO and λc1O, wherein said at least one operator is positioned such that binding of at least one repressor to said at least one operator represses transcription from said fusion promoter; and (b) comparing the extent of proliferation of a first sample of said cell in the presence of said compound to the extent of proliferation of a second sample of said cell in the presence of said compound, wherein said first sample of said cell has a reduced activity of said fusion promoter activity relative the activity of said fusion promoter in said second sample of said cell and wherein said compound inhibits the proliferation of said cell if the extent of proliferation of said first sample of said cell is substantially less than the extent of proliferation of said second sample of said cell.
 120. The method of claim 118, wherein the step of replacing said native promoter comprises introducing a linear nucleic acid comprising a 5′ portion homologous to a first portion of said native promoter, a 3′ portion homologous to a second portion of said native promoter and said fusion promoter disposed between said 5′ portion and said 3′ portion into said cell such that homologous recombination occurs between said 5′ portion and said first portion of said native promoter and between said 3′ portion and said second portion of said native promoter.
 121. The method of claim 120, wherein said linear nucleic acid is double stranded.
 122. The method of claim 120, wherein said linear nucleic acid is single stranded.
 123. A method for identifying a gene which is required for proliferation of a prokaryotic cell said method comprising the steps of: (a) introducing at least one operator selected from the group consisting of xylO, tetO, trpO, malO, lacO and λc1O into a prokaryotic cell having an enhanced frequency of homologous recombination such that said at least one operator regulates transcription of a target nucleic acid in the chromosome of said cell; and (b) identifying cells in which the extent of proliferation of said cell when said target nucleic acid is transcribed at a first level is substantially different than the extent of proliferation of said cell when said target nucleic acid is transcribed at a second level, said first level being greater than said second level.
 124. The method of claim 123, wherein the step of introducing said at least one operator comprises introducing a linear nucleic acid comprising a 5′ portion homologous to a first portion of the chromosome of said cell, a 3′ portion homologous to a second portion of the chromosome of said cell and said at least one operator disposed between said 5′ portion and said 3′ portion into said cell such that homologous recombination occurs between the 5′ portion and said first portion of the chromosome of said cell and between the 3′ portion and said second portion of the chromosome of said cell.
 125. The method of claim 124, wherein said linear nucleic acid is double stranded.
 126. The method of claim 124, wherein said linear nucleic acid is single stranded.
 127. A method of identifying a compound which inhibits the proliferation of a prokaryotic cell said method comprising the steps of: (a) obtaining a prokaryotic cell in which transcription of a nucleic acid required for proliferation of said cell is regulated by at least one operator which has been introduced into the chromosome of said cell said at least one operator selected from the group consisting of xylO, tetO, trpO, malO, lacO and λc1O; (b) sensitizing said cell by growing said cell under conditions in which the level of transcription of said gene is lower than that of a wild type cell; (c) contacting said sensitized cell with said compound; and (d) determining the degree to which said compound inhibits the growth of said sensitized cell relative to an unsensitized cell.
 128. The method of claim 127, wherein said cell has an enhanced frequency of homologous recombination.
 129. The method of claim 128, wherein said linear nucleic acid is single stranded.
 130. The method of claim 128, wherein said linear nucleic acid is double stranded.
 131. A method of identifying a nucleic acid sequence having promoter activity in Enterococcus faecalis said method comprising the steps of: (a) inserting a candidate nucleic acid into a vector comprising lacL-lacM reporter genes such that said candidate nucleic acid is upstream of the lacL-lacM reporter genes; (b) introducing said vector comprising said candidate nucleic acid into Enterococcus faecalis; (c) detecting expression of the lacL-lacM reporter genes, wherein expression of said lacL-lacM reporter genes indicates that said candidate nucleic acid sequence has promoter activity.
 132. The method of claim 131, wherein said detecting step comprises measuring β-galactosidase activity.
 133. The method of claim 131, wherein said candidate nucleic acid is a promoter modified to increase activity in a gram-positive organism.
 134. The method of claim 131, wherein said lacL-lacM reporter gene is derived from Leuconostoc mesenteroides.
 135. The method of claim 131, wherein said vector is pEPEF1. 